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Preface 


This quarterly publication provides archival reports on developments in programs 
managed by JPL’s Office of Telecommunications and Data Acquisition (TDA). In space 
communications, radio navigation, radio science, and ground-based radio and radar astron- 
omy, it reports on activities of the Deep Space Network (DSN) and its associated Ground 
Communications Facility (GCF) in planning, in supporting research and technology, in 
implementation, and in operations. Also included is TDA-funded activity at JPL on data 
and information systems and reimbursable DSN work performed for other space agen- 
cies through NASA. The preceding work is all performed for NASA’s Office of Space 
Operations (OSO). The TDA Office also performs work funded by two other NASA 
program offices through and with the cooperation of the Office of Space Operations. 
These are the Orbital Debris Radar Program (with the Office of Space Station) and 21st 
Century Communication Studies (with the Office of Exploration). 

In the search for extraterrestrial intelligence (SETI), the TDA Progress Report reports 
on implementation and operations for searching the microwave spectrum. In solar system 
radar, it reports on the uses of the Goldstone Solar System Radar for scientific explora- 
tion of the planets, their rings and satellites, asteroids, and comets. In radio astronomy, 
the areas of support include spectroscopy, very long baseline interferometry, and astrom- 
etry. These three programs are performed for NASA’s Office of Space Science and 
Applications (OSSA), with support by the Office of Space Operations for the station 
support time. 

Finally, tasks funded under the JPL Director’s Discretionary Fund and the Caltech 
President’s Fund which involve the TDA Office are included. 

This and each succeeding issue of the TDA Progress Report will present material in 
some, but not necessarily all, of the following categories: 

OSO Tasks: 

DSN Advanced Systems 

Tracking and Ground-Based Navigation 
Communications, Spacecraft-Ground 
Station Control and System Technology 
Network Data Processing and Productivity 
DSN Systems Implementation 
Capabilities for Existing Projects 
Capabilities for New Projects 
New Initiatives 

Network Upgrade and Sustaining 
DSN Operations 

Network Operations and Operations Support 
Mission Interface and Support 
TDA Program Management and Analysis 
Communications Implementation and Operations 
Data and Information Systems 
Flight-Ground Advanced Engineering 

OSO Cooperative Tasks: 

Orbital Debris Radar Program 
21st Century Communication Studies 



OSSA Tasks: 

Search for Extraterrestrial Intelligence 
Goldstone Solar System Radar 
Radio Astronomy 


Discretionary Funded Tasks 
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Deriving a Geocentric Reference Frame for 
Satellite Positioning and Navigation 

R. P. Malla+ 

S.-C. Wu 

Tracking Systems and Application Section 


With the advent of Earth-orbiting goedetic satellites, nongeocentric datums or reference 
frames have become things of the past. Accurate geocentric three-dimensional positioning 
is now possible and is of great importance for various geodetic and oceanographic applica- 
tions. While relative positioning accuracy of a few centimeters has become a reality using 
very-long-baseline interferometry (VLB1), the uncertainty in the offset of the adopted 
coordinate system origin from the geocenter is still believed to be on the order of 1 meter. 
Satellite laser ranging (SLR), however , is capable of determining this offset to better than 
10 cm, but this is possible only after years of measurements. Global Positioning System 
(GPS) measurements provide a powerful tool for an accurate determination of this origin 
offset. Two strategies are discussed in this article. The first strategy utilizes the precise 
relative positions that have been predetermined by VLB1 to fix the frame orientation and 
the absolute scaling, while the offset from the geocenter is determined from GPS measure- 
ments. Three different cases are presented under this strategy. The reference frame thus 
adopted will be consistent with the VLBI coordinate system. The second strategy estab- 
lishes a reference frame by holding only the longitude of one of the tracking sites fixed. 
The absolute scaling is determined by the adopted gravitational constant (GM) of Earth; 
and the latitude is inferred from the time signature of Earth rotation in the GPS measure- 
ments. The coordinate system thus defined will be a geocentric Earth-fixed coordinate 
system . A covariance analysis shows that geocentric positioning to an accuracy of a few 
centimeters can be achieved with just one day of precise GPS pseudorange and carrier 
phase data. 


I. Introduction 

The fully operational Global Positioning System (GPS) will 
consist of at least 1 8 satellites distributed in six orbital planes 
[1] . This system Will allow a user, anywhere on the Earth or 
in a low Earth orbit, to view at least five satellites most of the 
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time. Two precision data types can be derived from the GPS 
transmitted signals: P-code pseudorange and carrier phase at 
two L-band frequencies [2] . These precision data types pro- 
vide the opportunity to produce geodetic measurements accu- 
rate to the centimeter level [3] and orbit determination of low 
Earth orbiters to the subdecimeter level [4] . The ephemerides 
for the GPS satellites, as distributed by Naval Surface Weapon 
Center (NSWC), are based upon the World Geodetic System 
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(WGS 84) [5] , and their accuracy is on the order of 10 meters 
[6] . In applications where high precision is essential, the GPS 
satellite orbits need to be adjusted to a much higher precision, 
along with all the other parameters in the network [3] , [4] . 
The GPS satellites can be simultaneously observed from several 
sites in a geodetic network. Within such a network a few fidu- 
cial tracking sites are included [7] . The relative positions of 
these fiducial sites are known to a higher level of precision, 
typically a few centimeters, as a result of repeated measure- 
ments of the baselines using very-long-baseline interferometry 
(VLBI) [8] . Based upon these highly precise relative positions 
of the fiducial sites, filter strategies can be designed to adjust 
the satellite orbits to enhance their accuracy to far better than 
10 meters [9]. The ephemerides thus adjusted now refer to 
the same coordinate frame in which the fiducial baselines are 
known. It is generally believed that the best VLBI coordinate 
system origin approximates the geocenter to about 1 meter. 
The satellite laser ranging (SLR) technique is capable of realiz- 
ing the geocenter offset to better than 10 cm, but this is possi- 
ble only after years of observations. 

Although absolute positioning is of less interest for geody- 
namic applications, it can be an important factor when track- 
ing deep space vehicles, and it is essential for orbit determina- 
tion of Earth-observing satellites, such as NASA’s Ocean 
Topography Experiment (TOPEX), to be launched in late 
1991 [10]. This article investigates two strategies for pre- 
cise determination of the geocenter, thus establishing a geo- 
centric coordinate frame for GPS measurements. In the first 
strategy, GPS P-code pseudorange and carrier phase measure- 
ments are made from a set of globally distributed tracking 
stations. A network consisting of six stations appears to be 
appropriate. Of these, three are the fiducial sites whose relative 
location has been well determined by VLBI. Since it is the 
relative location, rather than the absolute location, of the 
fiducial sites that is well determined by VLBI, only baseline 
coordinates should be fixed to define the orientation and 
absolute scaling of the reference frame. The geocenter posi- 
tion and the coordinates of other, nonfiducial sites are to be 
adjusted together with the GPS orbits. The coordinate frame 
thus defined is consistent with the VLBI frame, with improved 
geocenter offset. Three different cases are discussed under 
this strategy. 

An alternate strategy is to simultaneously adjust the GPS 
orbits and geodetic station coordinates with respect to one 
reference site in the network whose longitude is held fixed. 
The absolute scaling is determined by the adopted gravita- 
tional constant GM of Earth; the station heights are inferred 
from the adjusted periods of GPS orbits and the pseudorange 
measurements; and the latitude is inferred from the time 
signature of Earth rotation in the GPS measurements. The 
coordinate system thus defined will be an Earth-centered, 


Earth-fixed (ECEF) coordinate frame. The solution is free 
from any a priori uncertainty of site positions, and the inferred 
reference frame is strictly self-contained. This type of tech- 
nique has been adopted by the satellite laser ranging (SLR) 
and lunar laser ranging (LLR) communities [11]. The coor- 
dinate origin offset from the geocenter is given by the weighted 
mean coordinate offsets of all stations in the network. 

A covariance analysis was performed estimating the accuracy 
with which the geocenter position can be determined using the 
two strategies. This analysis indicates that the geocenter posi- 
tion can be determined to an accuracy of a few centimeters 
with just one day of precision pseudorange and carrier phase 
data. Such precise knowledge of absolute position of the coor- 
dinate system origin is essential to the orbit determination of 
TOPEX, which requires an altitude accuracy of 13 cm or better. 

II. Coordinate Reference Frame 

A rectangular coordinate system is defined such that the 
Z axis coincides with the mean spin axis of the Earth as de- 
fined by the CIO pole; the X axis lies in the mean equatorial 
plane, which is perpendicular to the Z axis, and passes through 
the mean Greenwich astronomic meridian as defined by the 
BIH; the Y axis completes the right-handed Earth-fixed car- 
tesian system. The origin of the coordinate system may be 
defined as the center of mass of the Earth. But the imperfect 
knowledge of the geocenter location limits the precise location 
of this origin. 

Figure 1 gives the definition of the World Geodetic System 
84 (WGS 84). The almanac and the ephemerides of GPS satel- 
lites are given in this coordinate system [6] . The coordinates 
of the ground stations derived by observing the GPS measure- 
ments will also be in the WGS 84 reference frame. But it 
should be noted that the absolute accuracy of any geocentric 
position determination depends upon the knowledge of the 
location of the geocenter relative to the assumed origin. The 
coordinate system thus defined is an ECEF coordinate system 
which rotates at a constant mean rate around a mean astro- 
nomic pole. Such a system is also called a conventional terres- 
trial system (CTS). However, events occur in an instantaneous 
real world, which is in a coordinate system different from the 
CTS. Therefore it is required to mathematically relate CTS to 
an instantaneous terrestrial system (ITS). This relationship is a 
transformation through a wobble [W] and a spin [S] : 

^its = I®] [W] ^cts (1) 

where the X’s are position vectors. The wobble [W] is given by 

[W] = R x (-y p )R y (x p ) (2) 
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where R,(/?) denotes a matrix of rotation, by an amount p 
about the r axis; x p and y p define the pole motion. The sign 
convention used is in accordance with the BIH convention. 
The spin is given by 

[S] = R z (-GAST) (3) 

where GAST is the Greenwich Apparent Sidereal Time given 
by 

GAST = GMST oh UT +w(r rf/ + UT1 -UTC) 

+ AT' cos e (4) 

GMST oh UT is the Greenwich Mean Sidereal Time at 0 hour 
UT, which is obtained from Newcomb’s equation adjusted 
with respect to J2000 [12] , l o is the mean rate of advance of 
the GMST per day, and is the day fraction in UTC of time 
of observation. The last term in Eq. (4) is commonly known as 
the equation of the equinoxes, where AT' refers to the nuta- 
tion in longitude and e is the true obliquity of the ecliptic of 
date. 

In general, celestial bodies are expressed in the Conven- 
tional Inertial System (CIS). Position vectors in this system 
can be transformed into ITS through a nutation [N] and a 
precession [P] [13] : 

X CIS = [V] [N] X ITS (5) 

The nutation [N] is given by 

[N] = R x (-e o ) R z (-A*) R x (e o + Ae) (6) 

where e Q is the mean obliquity of date; the nutation angles 
AT' and Ae are computed from IAU 1980 nutation series (12] 
expressed with respect to J2000. The precession [P] is given by 

[P] = R 2 (-f a )R^(-0 a )R(z a ) (7) 

where f a , Q a , and z a are the standard precession rotation angles 
[14]. Therefore, the position vectors in the reference frame 
WGS 84, which is one of the CTS, can be expressed with re- 
spect to the CIS using the above transformations. 

The SLR system has matured enough to establish its own 
independent coordinate system. The dynamic technique used 
to establish such a system depends heavily upon a precise 
definition of the coordinate frame adopted by the tracking 
network. This includes the definition of polar motion and the 
Earth’s fundamental constants, such as the gravitational con- 
stant (GM), the dynamical form factor (J 2 ), and the speed of 
light. Because satellite (LAGEOS, STARLETTE, etc.) position 


vectors are described in an inertial frame while ground station 
vectors are described in an ECEF frame, they need to be re- 
lated by the above coordinate transformations. Processing of 
SLR long-arc data has been successful in simultaneously solv- 
ing for station vectors, satellite orbits, and earth orientation 
parameters to precisions of few centimeters. 

III. Strategies to Determine the Origin 
Offset from the Geocenter 

For the past several years the fundamental concept behind 
accurate GPS orbital adjustment has been that of the fiducial 
network [7] . A fiducial network consists of three or more 
tracking stations whose (relative) positions have been deter- 
mined in an Earth-fixed coordinate frame to a very high 
accuracy, usually by VLBI. Several receivers at other, less 
accurately known, stations also observe simultaneously the 
GPS satellites along with the fiducial network. The data are 
then brought together to simultaneously adjust the GPS satel- 
lite orbits and the positions of the nonfiducial sites. Thus the 
fiducial stations established by VLBI provide a self-consistent 
Earth-fixed coordinate system with respect to which the 
improved GPS satellite orbits and the nonfiducial stations can 
be expressed to a greater accuracy. At the same time the coor- 
dinate frame origin offset from the geocenter can also be 
estimated using the same set of data. Experience in this area 
has indicated that an over-constrained network, where more 
baselines or sites than necessary are fixed, can in fact produce 
a degraded solution. This is because in an over-constrained net- 
work the a priori uncertainty in the fixed parameters that are 
more than necessary will result in a suboptimal filter weight- 
ing. The solution will then be highly influenced by the mis- 
modeling of these parameters. 

In the first strategy proposed, the fiducial baselines are 
treated in three different ways: 

(A) Fix two fiducial baselines. 

(B) Constrain two fiducial baselines by a priori weighting. 

(C) Fix only one fiducial baseline. 

The baselines define the orientation of the adopted coor- 
dinate frame. The absolute scaling can be fixed either by the 
length of these baselines or by the Earth’s gravitational con- 
stant, GM. Both are known to an accuracy of about one part 
in 10 8 . The baseline length is used to define the absolute 
scaling so that the resulting coordinate frame will be consis- 
tent with the VLBI frame defined by the fiducial baselines. 
For the case with two baselines fixed, it is rather convenient 
to select one of the fiducial stations common to both fixed 
baselines as the reference site. The filter process is so designed 
that the baselines between the reference site and all other 
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nonfiducial sites are adjusted along with the Earth Orientation 
Parameters (EOP), namely polar motion (x , y p ) and UT1- 
UTC rate, the GPS satellite orbits, and the absolute coordi- 
nates of the reference site, which in turn infer the adjustment 
of the geocenter position coordinates. The Earth’s GM is also 
adjusted, although the data strength may not be great enough 
to improve the value of GM appreciably. 

In the second strategy, the same GPS tracking network of 
globally distributed stations is used. However, only the longi- 
tude of a reference site is held fixed; all other site coordinates 
are adjusted simultaneously with the GPS orbits. Here, the GM 
of Earth provides the absolute scaling. The station heights can 
be derived from the adjusted periods of GPS orbits and pseu- 
dorange measurements. The time signature of the measure- 
ments defines the latitude. Figure 2 graphically demonstrates 
the time signature of the measurements for two hypothetical 
cases. The first graph shows the periodic signature generated 
by the pseudorange (p) measurements to an orbiting GPS 
from a stationary receiver. The period is equal to the GPS or- 
bit period, which is nearly 12 hours, and the amplitude is pro- 
portional to the geocentric position vector of the receiver 
projected onto the orbital plane. The second graph shows the 
case when a stationary GPS satellite is above the equator of a 
spinning Earth. The period is now 24 hours, and the ampli- 
tude is proportional to the cosine of the receiver latitude. The 
variation of the signature with respect to the receiver latitude 
is depicted in the sketch. Because of the difference in period, 
the effects due to the rotation of the receiver can be separated 
from the GPS orbiting signature and the latitude can unam- 
biguously be solved. 

A simple mathematical model can be written out for the 
estimate of geocenter offset. This offset is expressed as the 
weighted mean of the position offsets of all stations. The equa- 
tions corresponding to the geocenter offset AG are represented 
as 

AG^ + Ax.+ v. = 0, x~*y,z\ (8) 

i = 1,2, ... ,n 

where Ax f is the x component of the z th geocentric station 
position offset and is the error associated with Ax The cor- 
responding error covariance of the geocenter offset can be 
expressed as 

Var (AG) = [A T WA]-‘ (9) 

3X3 

where 

A t = [-I-I...-I] 

3 X 3n 


and W is a (3 n X 3 n) weight matrix which is the inverted co- 
variance matrix of the station position estimates. 


IV. Covariance Analysis 

A covariance analysis was carried out to assess the accuracy 
with which the geocenter offset from the origin of the adopted 
coordinate frame can be determined with each of the approaches 
proposed in previous sections. A full constellation of 18 GPS 
satellites distributed in six orbital planes was assumed. A data 
arc spanning over 34 hours from a network of six globally 
distributed tracking stations was also assumed. The three fidu- 
cial sites are the three NASA Deep Space Network (DSN) 
tracking sites (Fig. 3) at Goldstone, California; Canberra, 
Australia; and Madrid, Spain. The remaining sites in Japan, 
Brazil, and South Africa are nonfiducial sites. Simultaneous 
GPS P-code pseudorange and carrier phase measurements are 
made at all of these stations. The relative positions of the three 
DSN sites have been measured repeatedly by VLBI over many 
years and are known to an accuracy of about 3 cm. Goldstone 
was selected to be the reference site because of its common 
VLBI visibility with the other two DSN sites at Canberra and 
Madrid. P-code pseudorange and carrier phase data noise were 
assumed to be 5 cm and 0.5 cm, respectively, when integrated 
over 30 minutes and corrected for ionospheric effects by dual- 
frequency combination. 

Carrier phase biases were adjusted with a large a priori 
uncertainty. Table 1 lists the error sources assumed for the 
first strategy. The robustness of the GPS measurements allows 
all the GPS and station clocks to be treated as white-noise 
processes and adjusted [3] , [4] to remove their effects on the 
solutions. Also adjusted are the zenith tropospheric delays at 
all ground sites, which were treated as random-walk param- 
eters to model the temporal change. Such models have been 
proved to be effective in removing their errors without ser- 
iously depleting the data strength [9] . 

The GPS covariance and simulation analysis software sys- 
tem, OASIS [15], recently developed at JPL, was used to 
carry out the study. In OASIS, partial derivatives with respect 
to cartesian components of site locations and the geocenter 
are readily produced. It is shown in the Appendix that base- 
line partials are related to site location partials as follows. 

(1) The partial derivative with respect to a cartesian com- 
ponent of the reference site is the sum of all partial 
derivatives with respect to the same component of all 
sites forming the baselines. Note that this is also the 
partial derivative with respect to the same component 
of the geocenter position. 
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(2) The partial derivative with respect to a baseline carte- 
sian component is the same as the partial derivative 
with respect to the same component of the nonrefer- 
ence site forming the baseline. 

Hence, the site location coordinate partials can readily be used 
in place of the baseline coordinate partials, and the geocenter 
offset coordinate partials in place of the reference site abso- 
lute coordinate partials. 

The second strategy assumes the same network of six track- 
ing sites. The estimated quantities are the coordinates of all 
six sites except the longitude of the reference site (Goldstone), 
together with the GPS satellite states, white-noise clocks, 
random-walk troposphere parameters, and carrier phase biases. 
Because the longitude of Goldstone is held fixed, the position 
components need to be given in a geodetic coordinate system, 
viz., longitude, latitude, and height. Table 2 lists the assump- 
tion variations that apply to this strategy. Other assumptions 
are kept the same as in Table 1. With this strategy, the error 
covariance matrix of geocenter offset is given by Eq. (9) in the 
previous section. 

V. Results of Covariance Analysis 

In the covariance analyses for both strategies, data arcs of 
various lengths were used to study the solution convergence. 
In all cases the station at Goldstone was considered to be the 
reference site, although in the second strategy any of the 
ground sites can be a reference site where the only fixed com- 
ponent is the longitude. 

Table 3 indicates the a priori error associated with the fidu- 
cial baselines, Goldstone-Canberra and Goldstone-Madrid, in 
all three cases of Strategy 1. The value of GM was adjusted, 
although it was found that the data strength of the GPS mea- 
surements is not great enough to improve on its a priori value. 
It should be noted that adjusting Earth’s GM makes GPS 
satellite states consistent with the absolute scaling as implied 
by the baselines. 

Figure 4 shows the total error of the origin offset as the 
length of the data span increases from 6 hours to 34 hours for 
Case A. At the end of 34 hours the origin offset error is 4.0 cm 
(rms of all three components). The bar chart shows a rapid 
reduction of error in origin offset between 6 and 12 hours. 
The result continues to improve after 12 hours but not at a 
very high rate. The reason for this can be seen in Fig. 5. After 
12 hours the origin offset error has come down to the level of 
baseline error; data gathered thereafter only gradually reduces 
the effects of data noise. At the end of 34 hours the effect of 
data noise is reduced to 3.4 cm and would continue to reduce 
as the arc length increases. The contribution of the baseline 


error, however, dropped to about 2.5 cm after 12 hours and 
remained virtually unchanged thereafter. This indicates that 
the geocenter can be determined only up to the a priori 
accuracy of the fiducial baselines. Therefore, with this strategy, 
any improvement on the baseline accuracy can improve the 
accuracy of the origin offset from the geocenter. For instance, 
it is customary to find baselines reported with a higher accu- 
racy in length than in the other two components. When a 
smaller error of 1 cm is assumed for the fiducial baseline 
length, along with 3 cm for the transverse and vertical com- 
ponents, the rms error on the origin offset from the geocen- 
ter reduces to 3.5 cm with a 34-hour arc of GPS measure- 
ments. Figure 6 shows the origin offset error for Case B, where 
the baseline vectors constrained to their a priori error are also 
estimated. The geocenter offset error after 34 hours reduces to 
3.8 cm. Note that the error involved here is mainly due to data 
noise alone. Results from Case C, where the Goldstone-Madrid 
baseline is the only baseline fixed, are plotted in Figs. 7 and 8. 
The geocenter offset error after 34 hours is 4.4 cm, which is 
slightly worse than the previous cases. In Fig. 8, however, the 
effect due to the fixed baseline reduces to 2 cm after 12 hours 
and settles at 1.7 cm after 18 hours. The effect due to the 
data noise will continue to decrease for longer data arc, but 
the baseline effect will remain unchanged, as shown by Figs. 5 
and 8. When the EOP are not estimated, the geocenter offset 
error after 34 hours is found to be 4.1 cm. This slight improve- 
ment is due to reduced data noise effect when fewer param- 
eters are estimated. 

In the second strategy no tracking site coordinates, except 
the longitude of the site at Goldstone, were held fixed. Here, 
as before, simultaneous adjustment of all GPS satellite states, 
tracking site coordinates, carrier phase biases, and zenith 
tropospheric corrections were carried out for various arc 
lengths ranging between 6 and 34 hours. Figure 9 plots the 
variation of the rms error of the origin offset from the geo- 
center with respect to the data arc length. The errors affect- 
ing the origin offset from the geocenter in this strategy are the 
data noise and the GM of Earth, which defines the absolute 
scaling. At the end of 6 hours the rms error of the origin off- 
set is 143.7 cm, which reduces to 8 cm at the end of 12 hours. 
This indicates that the control on the absolute scaling and the 
orientation in latitude is greatly improved after all the GPS 
satellites have been tracked by the globally distributed sites 
for a complete orbit cycle. At the end of 34 hours the rms 
error reduces to 2.1 cm. The graph shows a strong trend of 
decreasing rms error as the arc length increases. This indicates 
that the origin offset determination is limited only by the data 
noise. This result can be compared with Case C of Strategy 1 
when EOP are not estimated; there is about a 50% improve- 
ment in the geocenter offset error with this method. The 
Earth’s GM is known accurately enough so that its effect is 
on the order of 0.2 cm after 12 hours and is 0.1 cm at the end 
of 34 hours. 
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In the analysis of Strategy 2, the effects of polar motion 
and UT1-UTC have not been included. However, GPS mea- 
surements are insensitive to any constant UTl-UTC bias error. 
The analyses done with different cases of Strategy 1 have indi- 
cated that a constant bias for polar motion and a UT1-UTC 
rate can be included in the filter as additional adjusted param- 
eters without significantly degrading the performance. 

VI. Effect of Coordinate Frame Origin Offset 
on Orbit Determination of Low Earth- 
Orbiting Satellite 

To gain further insight into the significance of accurate 
definition of geocenter, the effect on the radial position of a 
low Earth-orbiting satellite, in particular TOPEX, was studied. 
The error assumptions used are the same as given in Table 1 
except for those parameters listed in Table 4. The result pre- 
sented by Case C of Strategy 1 shows that the origin offset 
accuracy is 4.4 cm (Fig. 7) with only one baseline fixed and a 
data arc of 34 hours. This value is the most pessimistic of all 
the results presented. Here, the origin offset was assumed to 
have an error of 4 cm in each component and left unadjusted. 
A reduced dynamic tracking technique [16] was implemented 
in the study where a fictitious 3-D force on TOPEX was 
adjusted as process noise with constrained a priori uncertainty. 
Figure 10 plots the error in the radial component of TOPEX 
caused by various sources. The total error in TOPEX altitude 
over the 2-hour arc has an rms value of 9.7 cm. Figure 11 
shows the altitude error variation with time, along with the 
part contributed by a 4-cm geocenter uncertainty, over the 
2-hour arc. Without the refinement with GPS measurements, 
the geocenter position uncertainty would be greater than 
10 cm, and the TOPEX altitude determination error would 
be greater than 14 cm. 

VII. Summary and Conclusions 

A geocentric coordinate frame provides a practical global 
reference system with a physically meaningful and unambigu- 


ous definition of the coordinate origin. Two basic strategies 
for establishing a geocentric coordinate frame for GPS mea- 
surements have been investigated. All three cases of the first 
strategy make use of the precise relative positions which have 
been predetermined by VLBI to fix the frame orientation and 
the absolute scaling, while the offset from the geocenter is 
determined from GPS measurements. The reference frame thus 
adopted is consistent with the VLBI coordinate system. The 
second strategy establishes a reference frame by holding only 
the longitude of one of the tracking sites fixed. The absolute 
scaling is inferred by the adopted gravitational constant (GM) 
of Earth; the orientation in latitude is inferred from the time 
signature of Earth rotation in the GPS measurements. The 
coordinate system thus defined is a geocentric Earth-fixed 
coordinate system. The covariance analysis has shown that 
geocentric positioning to an accuracy of a few centimeters can 
be achieved with just a one-day arc of precise GPS pseudo- 
range and carrier phase data. 

Each of the two strategies has its advantages in different 
applications. The first strategy should be adopted in applica- 
tions requiring a coordinate frame consistent with the VLBI 
reference frame. Among these applications are the monitoring 
of crustal motions in areas which have been investigated by 
VLBI observations and the determination of the Earth rota- 
tion parameters, viz., polar motion and variation of UT1-UTC. 
The second strategy, which holds the longitude at a reference 
site fixed, strictly limits itself in an ECEF frame established 
by the adopted values for the fixed longitude and the GM of 
Earth, and by GPS measurements. This method provides a 
superior result as long as the precise applications are within 
the same ECEF frame. Applications in which such an ECEF 
coordinate frame can be adopted include datum definition and 
network densification in an area where ECEF coordinates are 
appropriate. Various topographic and oceanographic surveys 
and prospecting surveys can benefit from its simplicity. In 
TOPEX orbit determination, this method can also be very 
convenient if a CTS frame such as WGS 84 is adopted. 
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Table 2. Variations of assumptions from Table 1 for Strategy 2 
(fixing only one longitude) 


Table 1. Error sources and other assumptions for Strategy 1 
(fixing baselines) 


Reference site: 

Goldstone 

Other fiducial sites: 

Canberra, Madrid 

Nonfiducial sites: 

Brazil, Japan, South Africa 

GPS constellation: 

18 satellites in 6 orbital planes 

Cutoff elevation: 

10 degrees 

Data type: 

P-code pseudorange; carrier phase 

Data span: 

6-34 hours 

Data interval: 

30 minutes 

Data noise: 

5 cm-pseudorange;0.5 cm-carrier 
phase 

Carrier phase bias: 

10 km (adjusted) 

Clock bias: 

3 jusec-white noise (adjusted) 

GPS epoch state: 

10 m; 1 mm/ sec (adjusted) 

Geocenter position: 

10 m each component (adjusted) 

Baseline coordinates: 

3 cm each component-fiducial; 

10 cm each component-nonfiducial 
(adjusted) 

Zenith troposphere: 

Random walk parameter (adjusted): 
20 cm bias; 1 .3 cm batch to batch 

Earth’s GM: 

One part in 10® 

Solar pressure: 

10% 

(UT1-UTC) rate: 

10 m/day (adjusted) 

Polar motion (x py y p ): 

10 m (adjusted) 


Reference site: 

Goldstone 

Reference site coordinates: 

10 m (latitude) 

(adjusted) 

0 m (longitude) 


10 m (height) 

Other site coordinates: 

10 m each component 

(adjusted) 



Table 3. Fiducial baselines in Strategy 1 


Case 

Baselines 

Adjusted 

a priori 

A 

Goldstone-Canberra 

no 

3 cm 


Goldstone-Madrid 

no 

3 cm 

B 

Goldstone-Canberra 

yes 

3 cm 


Goldstone-Madrid 

yes 

3 cm 

C 

Goldstone-Canberra 

yes 

10 cm 


Goldstone-Madrid 

no 

3 cm 


Table 4. Variations of assumptions from Table 1 for TOPEX 
orbit determination 


Data span: 

2 hours 

Data interval: 

5 minutes 

TOPEX epoch state: 

1 km; 1 m/sec (adjusted) 

3-D force on TOPEX: 

Process-noise (adjusted): 
0.50 jum/s 2 bias; 

0.35 mi n/s 2 batch to batch 

Gravity: 

50% of current uncertainty 
(20 X 20 lumped) 

Geocenter: 

4 cm each component 
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ORIGIN; EARTH'S CENTER OF MASS 

Z AXIS: PARALLEL TO DIRECTION OF THE CONVENTIONAL TERRESTRIAL 
POLE (CTP) FOR POLAR MOTION AS DEFINED BY THE BIH ON THE 
BASIS OF THE COORDINATES ADOPTED FOR THE BIH STATIONS 

X AXIS; INTERSECTION OF THE ZERO MERIDIAN PLANE DEFINED BY BIH 
(WGS 84 REFERENCE MERIDIAN PLANE) AND THE PLANE OF CTP 
EQUATOR 

Y AXIS: MEASURED IN THE PLANE OF CTP EQUATOR, tt/2 EAST OF X AXIS, 
THUS COMPLETING THE RIGHT-HANDED EARTH-CENTERED, 
EARTH-FIXED (ECEF) ORTHOGONAL COORDINATE SYSTEM 


Fig. 1. The WGS 84 coordinate system. 



STATIONARY GPS AT EQUATOR; 



hr 


hr 



Fig. 2. Time signature of GPS measurements. 
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Fig. 3. A global GPS tracking network. 
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Fig. 4. Convergence of geocenter offset determination using 
Case A of Strategy 1 (two baselines fixed). 
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Fig. 5. Breakdown of geocenter offset determination error using 
Case A of Strategy 1 (two baselines fixed). 
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Fig. 6. Convergence of geocenter offset determination using 
Case B of Strategy 1 (two constrained baselines). 



6 12 18 24 30 34 


DATA ARC LENGTH, hr 

Fig. 8. Breakdown of geocenter offset determination error for 
Case C of Strategy 1 (one baseline fixed). 



Fig. 7. Convergence of geocenter offset determination using 
Case C of Strategy 1 (one baseline fixed). 



Fig. 9. Convergence of geocenter offset determination using 
Strategy 2 (longitude at Goldstone fixed). 
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Appendix 

Measurement Partial Derivatives with Respect to Baseline Components 


Let the cartesian coordinates of the set of N tracking sites 
be (*j, y l9 zj, (x 2 ,y 2 , z 2 ), . . . , (x N ,y N ,z N ). We can form 
the following baseline components: 

b x,i = x i~ x 1 ’ 

' (A-l) 

/ = 2,3, ... ,N 

where site 1 has been selected as the reference site with which 
all baselines are formed. For completeness, we also define 


where 5 z - ; - is the Kronecker delta. The partial derivative of a 
measurement R with respect to the baseline components bj 
can be expressed in terms of those with respect to the site 
coordinates by the following chain rule: 

dR = dR dx i x 3 R dx 2 x x 3 R dx N 

db dx t db 3* 3 b . ’ * * 3x A , ’ 

x,j 1 x,f 2 xj N x,j 

(A-5) 


b x ,x =x v x ^y> z 


(A-2) which, with the substitution of Eq. (A-4), becomes 


for the reference site. For simplicity, but without loss of 
generality, partial derivatives with respect to only the x-com- 
ponent of baselines will be derived. The relation for the other 
two components follows directly. These equations can be 
rearranged as 


b r 


x i = 


/ = i 

i =2,3, ... ,N 


(A-3) 


b i + b V 

from which the following partial derivative can be formed: 


dx . 


1, 


3 b 


xJ ( *,/» 


/ = 1 

/ = 2,3, ... ,7V 


(A-4) 



/ = 1 

/ = 2,3, ... ,7V 

(A-6) 


Hence, the partial derivative of the measurement with respect 
to a cartesian component of a baseline is the same as that with 
respect to the same component of the nonreference site form- 
ing the baseline; and the partial derivative with respect to a 
component of the reference site is the sum of all partial deriva- 
tives with respect to the same component of all sites forming 
the baselines. 
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Orbits for satellites of the Global Positioning System ( GPS) have been determined with 
submeter accuracy . Tests used to assess orbit accuracy include orbit comparisons from 
independent data sets , orbit prediction , ground baseline determination , and formal errors. 
One satellite tracked for 8 hours each day shows rms errors below 1 m even when predicted 
more than 3 days outside of a 1-week data arc. Differential tracking of the GPS satellites 
in high Earth orbit provides a powerful relative positioning capability , even when a rela- 
tively small continental U. S. fiducial tracking network is used with less than one-third of 
the full GPS constellation. To demonstrate this capability , baselines of up to 2000 km in 
North America were also determined with the GPS orbits. The 2000-km baselines show 
rms daily repeatability of 0.3 to 2 parts in 10 8 and agree with very -long-baseline inter- 
ferometry (VLBI) solutions at the level of 1.5 parts in 10 8 . This GPS demonstration pro- 
vides an opportunity to test different techniques for high-accuracy orbit determination 
for high Earth orbiters. The best GPS orbit strategies included data arcs of at least 1 week , 
process noise models for tropospheric fluctuations , estimation of GPS solar pressure co- 
efficients , and combined processing of GPS carrier phase and pseudorange data. For data 
arcs of 2 weeks , constrained process noise models for GPS dynamic parameters signifi- 
cantly improved the solutions. 


I. Introduction 

The Global Positioning System (GPS), expected to be fully 
operational by the early 1990s, will consist of 24 satellites 
evenly spaced in six orbit planes at an altitude of about 
20,000 km. Knowledge of GPS orbits will provide the basis 
for highly accurate ground and satellite user positioning. A 
wide variety of users will benefit from this positioning capa- 
bility. GPS will be used at NASA’s Deep Space Network (DSN) 
stations in conjunction with very-long-baseline interferometry 
(VLBI) radiotelescopes for atmospheric calibrations, precise 


ground station position determination, monitoring of Earth 
orientation changes on time scales of less than one day [1] , 
and possibly time synchronization at the nanosecond level. 
Differential GPS-based accuracies for high Earth orbiters are 
expected at the several-meter level for altitudes of 5,000 to 
40,000 km [2] . Spacecraft maneuvering near and docking 
with the Space Station will carry GPS receivers and will use 
GPS signals for real-time and near-real-time navigation and 
guidance. Low Earth orbiting spacecraft such as TOPEX/ 
Poseidon [3] and the Earth Observing System platforms will 
have orbit determination available in post-real time to an 
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accuracy of better than 10 cm with kinematic smoothing 
techniques using advanced receivers to track GPS satellites 
simultaneously with a worldwide GPS ground tracking net- 
work. The relatively low cost and convenience of GPS ground 
receivers have created many new opportunities to monitor 
cm-level crustal motions in geologically active regions. Very 
dense ground networks may achieve accuracies equaling or 
surpassing those available from other generally more restric- 
tive techniques such as VLBI or satellite laser ranging. 

The GPS applications with the most stringent requirements 
include the DSN applications, subdecimeter low Earth orbit 
determination, cm-level measurements of Earth crustal motion, 
and cm-level monitoring of changes in Earth orientation. To 
reach these goals, GPS orbits will have to be determined to 
better than 50-cm accuracy. The Jet Propulsion Laboratory 
has been developing and testing GPS orbit estimation software 
and techniques for several years with the goal of demonstrat- 
ing the capability for high-accuracy orbit determination. GPS 
data from several field experiments in 1985 and 1986 have 
been used to determine precise GPS orbits. Although only 
seven developmental GPS satellites were operational and the 
ground fiducial tracking network was limited to sites in the 
continental United States, covariance studies indicated that 
with currently available GPS receivers and antennas it should 
be possible to produce orbits for well-tracked GPS satellites 
accurate to 1 m. Achieving this 1-m accuracy capability is a 
major milestone on the road to ultrahigh-precision GPS 
applications. 

In this article we present results demonstrating submeter 
accuracy for the GPS orbits determined from these field tests. 
Ground station coordinates were estimated simultaneously 
along with the orbit parameters. Accuracy of better than 3 cm 
has been achieved over baselines up to 2000 km, proving that 
GPS is already a very powerful technique for precise position- 
ing over continental distances. 

II. Data Acquisition and Processing 

A series of GPS field experiments took place March and 
November 1985, June 1986, and January 1988. These experi- 
ments were organized by JPL and were cooperative ventures 
with several different organizations participating. In the 
March 1985 experiment, data were collected for about 1 week 
at ground sites in the continental United States only. In 
November 1985, the tracking network also included three 
sites in Mexico, and the experiment lasted for about 2 weeks. 
The June 1986 experiment covered a 3-week period and in- 
cluded sites in the Caribbean region as well as a dense network 
of stations in Southern California. The data from the experi- 
ments up through 1986 has been processed, with the January 
1988 data expected to be distributed shortly. This article re- 


ports analysis based on the data collected during the November 
1985 and June 1986 experiments. Results from the March 

1985 experiment have been reported earlier [4] , [5] . 

A. The November 1985 and June 1986 Data Sets 

The November 1985 GPS experiment took place from 
November 12 through November 24. The June 1986 experi- 
ment spanned about 3 weeks; the results presented in this 
article are based on the first part of the experiment from June 2 
to June 10. Figure 1 shows the locations of the ground track- 
ing sites, which represent a subset of the total number that 
participated in the experiments. TI 4100 GPS receivers [6] 
were operated at most of the sites. SERIES-X(7) receivers 1 
built at JPL were used at Mojave and at Owens Valley Radio 
Observatory (OVRO). In November 1985, water vapor radiom- 
eters (WVRs) were available for wet tropospheric delay cali- 
brations at OVRO, at the Mexican sites, and at Mojave for part 
of the experiment. In June 1986, WVRs were used at Haystack, 
Mojave, and some of the Caribbean sites. Dry tropospheric 
delay calibrations were computed from surface measurements 
of barometric pressure. For receivers located at sites where 
WVRs were not available, wet tropospheric delay corrections 
were computed from surface meteorology data. Single-day-arc 
baseline and GPS orbit solutions were generated for examina- 
tion of residuals and to check the quality of the data. Novem- 
ber 12 and 17 were excluded because of data outages and other 
difficulties at some of the fiducial sites. For multiday-arc solu- 
tions, the November 1985 data set, covering 12 days, was 
divided first into two arcs of 7 and 5 days (November 13-19 
and November 20-24). This was necessary due to a maneuver 
of more than 100 km which took place on November 20 during 
which Navstar 4 was moved to a new orbit. Eventually, we 
modeled and solved for the maneuver as described below, and 
a single long data arc covering November 13-24 was constructed. 
During these GPS experiments, periods of common ground 
visibility lasted about 6 to 8 hours. Most satellites were visible 
continuously from a given ground station for only about 3 hours, 
so several times during the tracking period the receivers switched 
to a new combination. Navstar 8 was unusual in that it was 
visible for up to 8 hours from most of the ground sites. 

Because of the short tracking periods (relative to the GPS 
orbital period of 12 hours) from a limited network of ground 
sites, orbits determined from single-day passes in the 1985 and 

1986 field tests were significantly weaker than those deter- 
mined from multiday arcs. The additional strength from the 
multiday arcs derives mainly from the visibility of the satel- 


1 R. B. Crow, F. R. Bletzacker, R. J. Najarian,G. H. Purcell, J. I. Statman, 
and J. B. Thomas, “Series-X Final Engineering Report,” JPL Inter- 
nal Document D-1476, Jet Propulsion Laboratory, Pasadena, Cali- 
fornia, 1984. 
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lites from the ground during multiple orbit revolutions. With 
observations over more than one revolution, the orbital periods 
are more accurately determined and therefore the positions of 
the orbital nodes are more precise. The down-track satellite 
components benefit especially from the multiday arcs, as is 
sometimes manifested by a corresponding improvement in 
the eastern component baseline accuracy [5] . A second advan- 
tage of multiday-arc solutions is the \fn improvement in pre- 
cision, where n is the number of measurements, which can 
apply to any orbital or baseline component. As more satellites 
are launched and the tracking network expands geographi- 
cally, shorter arcs will achieve the same level of orbit accuracy. 

B. Data Collection and Processing 

Both GPS carrier phase and pseudorange data were received 
at all the sites equipped with TI receivers. Carrier phase only 
was used from the Series-X receivers, which are codeless. The 
carrier signals at LI and L2 bands (1.227 and 1.575 GHz) are 
modulated by a pseudorandom noise code called the P code, 
which operates at 10.23 MHz. Continuously tracked GPS 
carrier phase provides a very precise measure of range change , 
while the P code provides a measure of absolute range . The 
pseudorange is considerably noisier than the carrier phase data 
type, and in this experiment the pseudorange was corrupted 
by errors due to multipath signals. The GPS observables at the 
two L-band frequencies are linearly combined to remove the 
portion of ionospheric delay which varies as the inverse square 
of the frequency. For more details about the characteristics 
of the GPS signals, see [7] and [8] . Hereafter, the terms 
“carrier phase” and “pseudorange” refer to these linear com- 
binations of LI, L2 and Pi, P2. The GPS data were processed 
with the GPS Inferred Positioning SYstem (GIPSY) orbit 
determination and baseline estimation software, which was 
completed and tested at JPL shortly after the data were 
collected for the 1985 experiments. 

III. Orbit Determination Approach 

The JPL orbit determination software utilizes a pseudoepoch- 
state U-D factorized Kalman filter 2 [9] . The filter works as a 
batch sequential program with the option to model parameters 
as first-order exponentially correlated process noise, also com- 
monly called colored noise. The GIPSY software uses the 
J2000 reference system with observation partials for param- 
eters computed relative to the satellite epoch states [10] . 

For the November 1985 data set, the nominal GPS ephem- 
erides were obtained by using broadcast orbits as initial values 


2 S. C. Wu, W. I. Bertiger, J. S. Border, S. M. Lichten, R. F. Sunseri, 
B. G. Williams, P. J. Wolff, and J. T. Wu, “OASIS Mathematical 
Description, v. 1.0,” JPL Internal Document D-3139, Jet Propulsion 
Laboratory, Pasadena, California, 1986. 


and iterating to improve those orbits (and remove large unde- 
sired offsets resulting from different coordinate frame conven- 
tions) with a small subset of the data. During the iteration, 
GPS solar radiation pressure coefficients were also determined 
and these coefficients and the new orbits were used as nominal 
values for the more comprehensive precise orbit filter runs. 
In June 1986, the hybrid postfit ephemeris from the Naval 
Surface Weapons Center (NSWC) was used for nominal trajec- 
tories and a similar iteration was performed prior to the final 
filter solutions. 

The high-precision orbit determination strategy used with 
the November 1985 and June 1986 GPS data was based on 
previous experience with the March 1985 GPS experiment 
[5] . A key aspect of the orbit estimation process is the fidu- 
cial concept, where three or four receivers with well-known 
coordinates in a consistent reference frame are held fixed 
while all other parameters, including orbital states and coor- 
dinates of the nonfiducial sites, are estimated simultaneously 
in the filter. Reference 5 and the references therein discuss the 
fiducial concept as well as alternative approaches. The fiducial 
receivers for the experiments discussed in this article were 
collocated at VLBI sites. Haystack, Richmond, and Fort Davis 
were generally treated as fiducials. In some runs OVRO was 
held fixed, and in others Hat Creek was used as a fiducial so 
that one of the normally fixed receivers could be estimated 
using the GPS data in order to test the internal consistency. As 
more GPS satellites are launched and the ground tracking net- 
work is expanded from North America to include stations on 
other continents, we expect that fewer fiducial constraints 
will need to be applied and more station location parameters 
will be determined from the GPS data. 

A. Clock and Bias Parameters 

The results presented here are based on estimation of sta- 
tion and GPS clocks as white process noise. At each measure- 
ment epoch, each active clock is assumed to have a value un- 
correlated to its value at other epochs. Although some station 
clocks were running off hydrogen masers and most of the GPS 
clocks have well-characterized behavior typical of rubidium 
and cesium atomic standards, this extra information was not 
used. The white noise clocks were estimated simultaneously 
with the other parameters in the filter. When the process noise 
model for clocks is white noise, the results are virtually the 
same as would be achieved with double differencing [11] for 
clock elimination. 

The GPS carrier phase, when continuously tracked, pro- 
vides a very precise measure of range change from measure- 
ment epoch to measurement epoch. The absolute phase, how- 
ever, and hence the absolute range from transmitter to receiver, 
is ambiguous by an integral number of wavelengths. For each 
station-satellite tracking pass, a carrier phase bias parameter is 
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estimated, along with the other adjusted parameters, ignoring 
the integer constraint on its value. Over a period of hours, the 
signature of the range change precisely measured with the car- 
rier phase enables’the orbit to be determined. The pseudorange, 
on the other hand, provides a more direct range determina- 
tion but is much noisier and more susceptible to multipath 
errors than the carrier phase data type. The ultimate accuracy 
would be reached if carrier range were available. Carrier range 
is a range determined from carrier phase with the bias ambi- 
guities all resolved; it has the best features of both carrier 
phase (low noise, low multipath) and pseudorange (absolute 
range measure and geometric strength). Successful carrier 
phase ambiguity resolution has been reported over baselines of 
up to 2000 km with bias fixing or bias optimizing techniques 
applied to single-day arcs [12], [13]. The results reported 
here were achieved by estimation of the white noise clock and 
carrier phase bias parameters with a very large a priori uncer- 
tainty, so that their solutions were basically unconstrained. Bias 
fixing was not used to resolve the carrier phase ambiguities. 

B. Tropospheric Delay Fluctuations 

The troposphere was modeled as a spherical shell which 
adds a delay along the GPS signal path: 

o *»..*.(•) 0) 

where p z is the zenith tropospheric path delay and R is an ana- 
lytic mapping function [14] to map zenith delays to line-of- 
sight path delays at elevation 0. There are two components to 
the tropospheric delay-the wet and the dry, denoted here 
with subscripts w and d. The dry component can be deter- 
mined under the assumption of hydrostatic equilibrium using 
the ideal gas law for dry air to better than 1 cm [15] . The wet 
delay component, although considerably smaller than the dry, 
exhibits greater time and spatial variation and is much more 
difficult to determine accurately. 

WVRs were operated at some of the GPS tracking sites 
alongside the GPS receivers. WVR calibrations are believed 
accurate to 2 cm or better for determination of the wet zenith 
path delay [15] . The algorithm described in [16] was used to 
determine these calibrations. At the other sites, surface meteo- 
rology (SM) measurements (temperature, air pressure, and rela- 
tive humidity) were used for the wet zenith delay calibration 
[17] . The SM calibration is much less reliable than the WVR 
calibration because the surface meteorological conditions are 
not always well correlated with the total atmospheric water 
vapor content [15] . 

Wet zenith delay corrections to the calibrations (WVR or 
SM) were estimated with the GPS data for all GPS tracking 
sites. For most sites with WVRs, a constant wet zenith delay 


parameter with an a priori constraint of 3 cm was estimated 
for each 8-hour tracking day. For sites using SM calibrations, 
the wet zenith delay was estimated daily with a 20-cm a priori 
constraint. In addition, stochastic residual delays were esti- 
mated for SM sites in order to remove signatures which could 
result from temporal variations in the troposphere, time- 
varying errors in the SM calibrations, errors in the mapping 
function, or spatial inhomogeneities due to azimuthal asym- 
metry in the water vapor content. In some cases, tightly con- 
strained process noise troposphere residual delay parameters 
were also estimated for the WVR sites. 

C. Troposphere Process Noise Models 

The stochastic model in the GIPSY filter is for a first-order 
exponentially correlated process noise [9] . The measurements 
are processed in discrete time segments, known as batches. In 
each batch, process noise parameters are modeled as piecewise 
constants. At the end of a batch, a process noise time update 
adds noise to the covariance matrix and thus causes the time- 
varying behavior of the stochastic parameters. The process 
noise time update for the ;th batch maps the estimates and 
covariance for the stochastic parameters into batch / + 1 : 

P /+ , = M/P/ + w y (2) 

where p y is a vector of estimates for the stochastic parameters 
and M is a diagonal process noise mapping matrix. The process 
noise w ; - is a random process with zero mean and 

E ( w / w k) = Q V 0) 

where Q is the covariance matrix diagonal and 8 ; - k is the 
Kronecker delta function [9] . The diagonal entries of M are 
given by 

m ij = ex p [-(' /+ i -9 lT n ] (4) 

where tj is the start time for the jth batch and r i j is the time 
constant for the ith stochastic parameter at the/th batch. The 
corresponding diagonal entry in the matrix Q is 



where a iss , the steady-state sigma for the ith stochastic param- 
eter, is the noise level that would be reached if the system 
were left undisturbed for a time much greater than r. The 
process noise model for each parameter is fully specified by 
o ss and r, which can also vary with time, although the sub- 
script j has been left off o ss for simplicity of notation. There 
are two special limiting cases: white process noise, and a ran- 
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dom walk. For white process noise, r = 0, m = 0, and, as can 
be seen in Eq. (2), the a priori covariance for the process noise 
parameters, p, is completely reset at the end of each batch, 
including zeroing of off-diagonal terms and inserting q for the 
variance on the diagonal. For white noise, the process at each 
time step is independent and uncorrelated with the process at 
other time steps. The opposite case is the random walk. Here 
both o ss and r are unbounded, since a steady state is never 
reached and r = °°. For the random walk, however, q is still 
defined by Eq. (2), where M is now equal to the identity 
matrix. The Allan variance [18] , a 2 (Af), which is often used 
to characterize clock and atmospheric fluctuations [19], is 
directly related to the random walk q: 

o 2 a (A/) = q/At 2 (random walk) (6) 

A wide range of process noise models has been tested on 
the March and November 1985 and June 1986 GPS data sets 
[5] , [15] , [20] . The random walk zenith tropospheric delay 
models with in the range 2 to 4 X 10~ 7 km-s” 1 / 2 (for SM 
sites) produced the best daily baseline repeatability, agreement 
with VLBI, and orbit repeatability. When only constant zenith 
delay parameters were estimated, orbit and baseline accuracies 
were worse by about a factor of 2. The value of \/q adopted 
for most of the November 1985 and June 1986 analyses was 
2 X 10 -7 km*s~ 1 /' 2 , corresponding to about 6 cm variation 
over a 24-hr period. There is evidence from VLBI residuals 
[19], [21] that tropospheric delay and delay-rate fluctuations 
can be well modeled as random walks for At greater than a few 
hundred seconds. Since the GPS data were compressed to 
300-sec intervals, the use of random walk tropospheric fluc- 
tuation models for GPS is consistent with the VLBI findings. 
As discussed in [5] , however, the GPS process noise param- 
eters were used to estimate a fluctuating residual correction to 
calibrated data (WVR or SM), and it is not clear that this 
quantity will have the same stochastic characteristics as the 
tropospheric fluctuations themselves. It was assumed that the 
spectral characteristics of both the tropospheric fluctuations 
and the residuals after calibration would be similar. 

D. Solar Radiation Pressure and Other 
Nongravitational Forces 

Multiday arcs covering 1 to 2 weeks were used to achieve 
the highest GPS orbit accuracy. With a global tracking system 
equipped with high-performance GPS receivers and a full 
24-satellite GPS constellation, covariance analysis predicts 
GPS orbit accuracy well below 1 m after just several hours of 
tracking [22]. However, as can be seen from Fig. 1, the 
ground tracking network during 1985 and 1986 was limited, 
with fiducials in North America only. The seven GPS satellites 
that were operating at that time by design tend to converge 


over the southwest United States. This further limited com- 
mon ground visibility to less than 8 hours per day and reduced 
the geometrical strength of the system. In addition, the pseu- 
dorange available from the TI receivers that were used in 
1985 and 1986 was highly contaminated by ground multipath, 
thereby raising the effective measurement noise. Because of 
these factors, multiday arcs were necessary to achieve the 
desired improvement in ephemeris accuracy. 

With single-day (8-hour) orbit solutions, there is very little 
sensitivity to errors in the GPS solar radiation pressure coeffi- 
cients [4] . However, for multiday arcs with multiple orbit 
revolutions, the orbital period is much more accurately deter- 
mined and therefore the results become sensitive to down- 
track errors resulting from integration of accelerations due to 
solar radiation forces acting on the GPS satellites, mismodeled 
solar radiation, or unmodeled forces such as thermal radiation 
from the spacecraft body. The GPS Block I ROCK4 model was 
used to represent accelerations resulting from solar radiation 
pressure. As described in [5] and references therein, ROCK4 
models 13 surfaces on the satellites according to their size, 
curvature, reflectivity, specularity, and absorption character- 
istics. The model as implemented in GIPSY allows for adjust- 
ment of three parameters: G x , G yy and G z . G x and G z are 
scaling factors in the local spacecraft jc and z directions, where 
the z axis is positive along the antenna toward the center of 
the Earth, the y axis is along the solar panel support beam 
normal to the spacecraft-Sun-Earth plane, and the x axis com- 
pletes a right-handed coordinate system. G y represents a con- 
stant acceleration in the jy-axis direction, often referred to as 
the y-bias parameter [23] . 

In principle, the G x and G z parameters should have the 
same value if the spacecraft were perfectly aligned and the 
model were correct. GPS orbits have been determined in the 
past with estimation of only G y and one parameter (desig- 
nated here as G xz ) to represent both G x and G z [24] , [25] . 
For the multiday arcs determined with the March 1985 data 
[5], G x , G y , and G z were estimated independently as con- 
stants for each satellite with the intention of adding an extra 
degree of freedom ( G x ^ G z ) to absorb unmodeled accelera- 
tions and known deficiencies in the ROCK4 model, which are 
thought to amount to as much as 4 m error over a 14-day pre- 
diction interval [23] . This strategy was successful for data 
arcs up to about 1 week long. However, for longer arcs of up 
to 2 weeks, a noticeable and systematic degradation in daily 
baseline repeatability occurred with the three-parameter con- 
stant solar coefficient approach. A new approach was adopted 
in which two constant solar pressure coefficients were esti- 
mated, G y and G xz , along with two tightly constrained pro- 
cess noise parameters, G x and G z . With this approach, daily 
baseline repeatability continued to improve as the arc was 
lengthened. 
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As an alternative to the use of process noise on G x and G ZJ 
small colored noise accelerations were introduced for the long 
November 13-24 arcs. These consisted of three parameters 
representing a constant thrust in the directions of down-track, 
cross-track, and altitude. 

Both stochastic solar pressure parameters and stochastic 
thrust parameters gave similar repeatability results for the 
2-week data arc, Fig. 2. The results were insensitive to the time 
constant, r, as long as the value of q was held about constant 
(see Eqs. 4 and 5). For G x and <7 Z , r was varied from 1 to 28 
days with q « 9.3 X 1G~ 6 , and the repeatability results were 
essentially the same. The results for the thrust parameters were 
more sensitive to the time constant. The best results were not 
obtained until the time constant was made less than 2 days 
and the value oiq^lX 10~ 28 km 2 /sec 4 . Similar results were 
obtained when the time constant for the thrust parameters 
was made as short as 0.5 day. A single stochastic thrust in the 
direction of the satellite long track also yielded good repeat- 
ability. With the present data, which consists of relatively 
short arcs at the same time each day, one cannot determine 
whether the effects are due to solar pressure mismodeling or 
other random or systematic accelerations acting on the GPS 
spacecraft. One possibility is that a solar pressure stochastic 
model is physically the correct approach but the stochastic 
fictitious thrust model works as well, as long as the thrust time 
constant is short enough to absorb the daily variations implicit 
in the solar radiation pressure signature. Note that another 
approach to this problem has been proposed in which ficti- 
tious force parameters with 24-hr resonances are estimated to 
remove solar pressure, gravity, and other dynamic errors 
which tend to repeat daily [28] . It is hoped that the sources 
of these forces can be isolated when more data and global 
tracking are available. This may be possible with the CASA 
UNO data set [29] . 

E. GPS 4 Maneuver 

A maneuver was performed on GPS 4 (SV 8) at approxi- 
mately UT 0320 November 20, 1985. In a few hours the space- 
craft state was changed by over 100 km and it was moved to a 
slightly different orbit. This time was in the middle of the 
experiment but outside the data collection interval, which was 
from about UT 1 100-1900 each day. In order to perform con- 
tinuous orbit determination over the entire experiment inter- 
val (November 13-24) for all the satellites simultaneously, we 
had to model and estimate the maneuver. The NSWC param- 
eterized the maneuver as a constant thrust over 5 minutes. We 
used a four-parameter model with three instantaneous velocity 
changes and a time-of-burn parameter, which we refer to as an 
impulsive motor burn. The impulsive motor burn allows for an 
instantaneous change in position and velocity while leaving the 
acceleration unchanged. Let AV be the vector velocity incre- 
ment, with three components AF^, AF C , and AV L , where 


subscripts H, C, andL refer to the local spacecraft coordinates 
of altitude, cross-track, and down-track. Let AT B denote the 
time of burn, e.g., the interval over which the burn is applied. 
Then the vector change in position is 

Ar=i-AVr £ ={aAr| (8) 

The acceleration, a, defined by AV and A T Bi is the equivalent 
constant acceleration that would be experienced over the 
interval A T B . The solution for the maneuver parameters AV H , 
AF C , AF l , and A T B was obtained by iterating over a data arc 
spanning November 18-22. The maneuver solution converged 
after three iterations and showed meter-level agreement with a 
separate solution for the GPS 4 position using only data col- 
lected after the maneuver (November 20-24). This nominal 
representation for the maneuver was further refined in later 
longer-arc high-precision GPS orbit solutions. It is interesting 
to note that the maneuver solution did not converge initially 
unless we constrained G x = G z for GPS 4. 

IV. Assessment of Orbit Accuracy 

Five criteria were used to assess the accuracy of the GPS or- 
bits determined from multiday arcs in November 1 985 and June 
1986: (1) orbit repeatability; (2) orbit prediction; (3) daily 
baseline repeatability; (4) agreement between GPS and VLBI- 
determined baselines; and (5) formal errors from the orbit 
filter. 

A. Orbit Repeatability 

Orbit repeatability indicates the precision and, to some 
extent, the accuracy of the GPS orbits. Figure 3 illustrates 
how orbit repeatability was computed for the November 1985 
experiment. The purpose of orbit repeatability is to compare 
orbits determined independently without any common mea- 
surements and then compute the rms difference over a time 
interval during which no data were used for either of the two 
solutions. Figure 4 shows the mean of the rms computed for 
all seven GPS satellites over a 6-hr interval on November 17. 
Figure 4 also shows the significant improvement attained when 
pseudorange is processed with the carrier phase and stochastic 
tropospheric delay models are used. From the formal errors, it 
appears that the pseudorange contributed little geometric 
strength to the orbit solutions, since the rms scatter of indi- 
vidual measurements was 100 to 300 cm for 6-minute mea- 
surement intervals, due mostly to ground multipath. However, 
when the pseudorange and carrier phase are processed together 
and a common clock is estimated, the pseudorange provides 
a priori knowledge of the clock and carrier phase bias param- 
eters at the several-nanosecond (100-300 cm) level, signifi- 
cantly improving the orbits. 
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GPS 6 and 8, the two satellites with the most data and best 
ground viewing geometry, had formal errors below 1 m for 
most of the November experiment and had significantly lower 
orbit repeatability rms than the other satellites. Figure 5 shows 
the repeatability computed over a 24-hr interval on Novem- 
ber 17 when no measurements were used for either orbit solu- 
tion. Since the two solutions being compared were determined 
from independent data sets (Fig. 3), it is concluded that sub- 
meter orbit precision has been demonstrated for these two 
satellites. 

B. Orbit Prediction 

Orbit prediction is a stringent test of orbit accuracy, since 
the estimated spacecraft position and velocity are mapped out- 
side of the measurement interval to give a predicted satellite 
state. Orbit errors tend to be magnified in this mapping pro- 
cess. The accuracy of the orbit models used for mapping are 
also tested by the orbit prediction test, in addition to the 
accuracy of the satellite ephemerides. Figure 6 shows the pre- 
diction test we applied to our November 1985 multiday-arc 
orbits. The average rms errors for four of the satellites were 
0.7 m, 0.8 m, and 1.7 m in altitude, cross-track, and down- 
track components. GPS 8, which was tracked longer than any 
of the other satellites, had a prediction rms error well below 
1 m, even when mapped more than 3 days into the second arc. 
For the first 6 hours of the prediction interval, the rms error 
was 50 cm or less for all three position components. These 
results are shown in Figs. 7 and 8. 

C. Daily Baseline Repeatability and Agreement 
Between GPS and VLBI Solutions 

To further assess the GPS orbit accuracy determined with 
multiday arcs, we have examined daily baseline repeatability as 
well as agreement with independent VLBI baseline measure- 
ments over continental distances (1000 to 2000 km). For the 
November 1985 experiment, we examined baselines between 
Hat Creek, CA and Fort Davis, TX (1933 km); Mojave, CA and 
Fort Davis, TX (1314 km); and Richmond, FL and Haystack, 
MA (2046 km). Daily repeatability was computed as the rms 
about the weighted mean of the daily baseline solutions deter- 
mined simultaneously with one multiday-arc orbit solution. 
For these baselines in North America with good common visi- 
bility of the GPS, the rms scatter was 0.3 to 2 parts in 10 s of 
baseline length for all vector components. Agreement with 
VLBI was 0.3 to 1.5 parts in 10 s for baselines with the same 
type of GPS antenna at both ends. Figures 9 and 10 show the 
results for the 2000-km baselines. 

For determination of the Hat Creek-Fort Davis baseline, 
Fort Davis, Richmond, and Haystack were held fixed as fidu- 
cial stations. For the Richmond-Haystack baseline solution, 
a separate filter run was made with Hat Creek, Fort Davis, and 


Haystack fixed as the fiducial reference points. Note that in 
the case of the Richmond-Haystack baseline the Hat Creek 
fiducial does not have data in the second half of the 2-week 
data arc, reducing the number of fiducials to two for the 
second half. The data from the first half has already deter- 
mined the GPS positions sufficiently that the lack of a third 
fiducial does not degrade the quality of the solution. Although 
high-quality orbits are a prerequisite for good precision and 
accuracy over long baseline distances, there are other factors 
that can affect baseline accuracy aside from GPS orbits. For 
example, local site vectors between the GPS and VLBI an- 
tennas sometimes are inaccurate by several cm. One such local 
survey error was recently discovered at OVRO, leading to a 
5-cm discrepancy between the GPS and VLBI Mojave-OVRO 
baseline until it was corrected [26] . Therefore, while the daily 
baseline repeatability provides a measure of consistency for 
orbits determined from multiday arcs, agreement with VLBI 
is a measure of overall system accuracy , which depends on a 
number of factors in addition to orbit accuracy. 

When different GPS antennas were used at the ends of a 
baseline, although daily repeatability was excellent, agreement 
with VLBI was worse by 1 to 7 cm, with no apparent depen- 
dence on baseline length. This was noticed in both the Novem- 
ber 1985 and June 1986 experiments. However, baselines with 
the same type of GPS antennas showed good agreement with 
VLBI. Since ephemeris errors tend to scale with baseline 
length, it was hypothesized that these discrepancies were due 
to local phenomena rather than orbital effects. Attention has 
been directed at phase center variations in the antennas, since 
the TI antennas are designed so that in operation the phase 
center variations nearly cancel out between sites that are not 
more than a few thousand km apart [6] . However, the Series-X 
antennas do not have the same phase center characteristics as 
the TI antennas, and the signatures resulting from the several- 
cm phase center variations that have been measured [27] 
could corrupt baseline measurements between unlike antennas. 
Therefore we qualify our high-precision results with the warn- 
ing that measurements between different GPS antennas may 
be much less reliable and may be affected by unpredictable 
effects. 

D. Baseline Repeatability Outside the 
Fiducial Network 

The baselines in North America are fairly well determined 
because they are either inside or near the fiducial network and 
because the current limited GPS constellation by design is op- 
timized for North America, especially the southwest United 
States. The formal errors from the filter are consistent with 
the results in Figs. 7, 8, and 9, predicting precision of 1 to 4 cm 
over these 2000-km baselines. To further test the robustness 
of the multiday-arc GPS orbits, we have determined baselines 
between Richmond and several sites in the Caribbean Sea 
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region occupied during the June 1986 experiment. Figure 10 
shows daily baseline repeatability for Richmond-Grand Turk 
(1049 km) and Richmond-Isabela (1582 km) determined 
from an 8-day orbit fit for June 2-10, 1986. Because the 
Caribbean sites are far to the southeast of both the fiducial 
network and the optimal region for the GPS constellation, the 
formal errors for these baselines are typically 2 to 7 cm, some- 
what worse than those of the North American baselines. De- 
spite the degraded geometry and reduced common visibility, 
the baselines to the Caribbean sites show precision of 1 to 4 
parts in 10 8 . Some of the Caribbean sites were equipped with 
WVRs; for these as well as the sites without WVRs, residual 
tropospheric corrections determined from the GPS data were 
critical in achieving these levels of baseline precision. Since the 
Caribbean sites are considerably more humid than most of the 
North American sites, various strategies for reducing system- 
atic errors due to uncertainties in the wet troposphere cor- 
rection are being studied with this data set. 

V. Conclusions 

It has been demonstrated that submeter GPS orbits can be 
determined using multiday arc solutions with the current GPS 
constellation subset visible for about 8 hours each day from 
North America. Submeter orbit accuracy was shown through 
orbit repeatability and orbit prediction. North American base- 
lines of 1000 to 2000 km in length can be estimated simul- 
taneously with the GPS orbits to an accuracy of better than 
1.5 parts in 10 8 (3 cm over a 2000-cm distance) with a daily 
precision of 2 parts in 10 8 or better. The most reliable baseline 
solutions are obtained using the same type of receivers and 
antennas at each end of the baseline. Baselines longer than 


1000 km between Florida and sites in the Caribbean region 
have also been determined with daily precision of 1 to 4 parts 
in 10 8 . The Caribbean sites are located well outside the fidu- 
cial tracking network and the region of optimal GPS common 
visibility, so these results further demonstrate the robustness 
of the multiday-arc GPS orbit solutions. 

Process noise models have been used in the orbit determina- 
tion filter to minimize systematic errors which can seriously 
affect ephemeris and baseline accuracy. These systematic 
effects include tropospheric delay fluctuations and small, un- 
modeled spacecraft accelerations. The process noise tropo- 
sphere models improved all orbit and baseline solutions, re- 
gardless of length of data arc. Tightly constrained process 
noise representation for part of the solar pressure model sig- 
nificantly improved baseline repeatability for arcs longer than 
1 week; however, an equally effective technique had fictitious 
thrusts estimated stochastically for each GPS satellite. Because 
of the limited ground visibility with the current constellation, 
it is not yet possible to determine whether the accelerations 
are genuinely related to solar radiation pressure or are due to 
other random or systematic forces acting on the spacecraft. 

This demonstration of several-cm accuracy over distances 
of a few thousand km, despite a limited ground tracking net- 
work and a constellation of only seven satellites, proves that 
GPS provides a very powerful relative positioning capability. 
It shows that GPS techniques have the intrinsic data strength 
and robustness needed for DSN high-precision applications, 
as well as low Earth orbiter tracking and crustal motion 
studies. 
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Fig. 3. Orbit repeatability for November 1985 
uses data from Nov. 13, 15, 18 for one solution 
and Nov. 14, 16, 19 for the other. Rms difference 
between the two solutions is computed over 
Nov. 17, during which no data was taken. 


Fig. 1. Locations of ground tracking sites used in the analysis of 
GPS data from the November 1985 and June 1986 experiments. 



Fig. 2. Daily baseline repeatability with 2-week orbit arcs 
(Nov. 13-24, 1985) for the Mojave-Fort Davis 1314-km baseline 
showing dramatic improvement when stochastic force parameters 
are estimated for GPS satellites. 



TROPOSPHERIC TROPOSPHERIC TROPOSPHERIC 

Fig. 4. Rms orbit repeatability over a 6-hr interval on Nov. 17. Rms 
between independent arcs shown in Fig. 3 for altitude, cross-track, 
and down-track components have been averaged for all seven GPS 
satellites. 
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Fig. 5. Submeter orbit repeatability for GPS 6 and 8. Rms difference 
between two independent solutions is computed over a 24-hr 
interval on Nov. 17 when no measurements were used. 
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AND ARC 2 ORBITS 

Fig. 6. Orbit prediction test for Nov. 1 985. Orbits determined from Nov. 1 3-1 9 are mapped ahead, 
and the rms difference is computed with an independent solution determined from Nov. 20-24. 
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Fig. 7. Difference between arc 1 predicted orbit and arc 2 orbit as 
shown in Fig. 6 for GPS 8. Rms is taken over a 6-hr interval. 
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Fig. 8. Orbit prediction difference similar to Fig. 7 except that here | 

the difference and rms are shown for a prediction interval of more * 

than 3 days. 






Fig. 10. Daily GPS baseline repeatability and agreement with VLBI 
for the Richmond-Haystack 2046-km baseline determined with a 
multiday orbit fit from Nov. 13-24, 1985. 


Fig. 9. Daily GPS baseline repeatability and agreement with VLBI 
for the Hat Creek-Fort Davis 1933-km baseline determined with a 
multiday orbit fit from Nov. 13-24, 1985. 
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Fig. 11. Daily GPS baseline repeatability for two baselines (1049 
and 1582 km) to the Caribbean region determined with multiday arc 
orbits from the June 1986 experiment. 
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This report considers two-way coherent Doppler error resulting from phase scintilla- 
tions induced on the uplink by the solar corona . It is shown that this error can be esti- 
mated by taking statistics on the differential Doppler measurements. Typical estimates 
for the error are given for four Sun-Ear th-pro be angles and for integration times ranging 
from 1 second to 1 minute. These results are based on data collected during the 1985 
Voyager 2 solar conjunction. 


I. Introduction 

When a spacecraft is in the same area of the sky as the Sun 
and beyond it, the radio beams to and from that spacecraft 
experience phase scintillations due to passage through the solar 
corona. These scintillations will dominate all other sources of 
error for two-way coherent Doppler measurement when the 
Sun-Earth-probe angle is small. Thus, for those missions that 
feature numerous or long-lasting solar conjunctions, especially 
inner planetary missions, it is important to characterize this 
source of Doppler error. Unfortunately, the relevant param- 
eters of the solar corona are highly variable, and adequate 
statistical characterizations do not exist. It has, however, been 
possible to measure the phase scintillations induced on 2.3- 
and 8. 4- GHz downlinks by the solar corona. It is in fact possi- 
ble to make this measurement using any Sun-encountering 
spacecraft with two downlinks of different but related fre- 
quencies. The phase scintillations induced on the downlinks, 
once measured, can be removed from the Doppler phase 
record. The phase scintillations induced on the uplink, on the 
other hand, remain. These cannot be measured or removed 
unless two simultaneous uplinks are available. 

This report considers two-way coherent Doppler error 
resulting from phase scintillations induced on the uplink by 


the solar corona. This error is estimated by taking statistics 
on phase scintillations induced on the downlinks of Voyager 2 
during a 1985 solar conjunction. 

II. Measuring Phase Scintillations Induced 
on the Downlinks 

Whenever a two-way coherent Doppler measurement is 
being performed, the downlink frequencies f x and f 2 are 
related to the uplink frequency f Q by the transponding ratios 
G x and G 2 and the relative velocity of the spacecraft and the 
deep space station. In addition, there are noise terms that 
represent the phase scintillations picked up in transit through 
the solar corona. These frequency relationships may be ex- 
pressed by 

G. v 2 v 2 

f. = G l f 0 (l- 2 p/c) + -r L +-f 4 ri ' = 1.2 ( 1 ) 

J 0 J 0 i 

The nonrelativistic Doppler shift has been indicated here; it is 
proportional to the ratio of the range rate p to the speed of 
light, c. 
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The parameter , in units of Hz 2 , represents the level of 
phase scintillations induced on the uplink by the solar corona. 
It depends only on the physical properties of that part of the 
corona through which the uplink radio beam passes. It is an 
implicit function of time. By taking the integral of the square 
of the local plasma frequency along the uplink ray path, then 
taking a time derivative and dividing by twice the speed of 
light, one could calculate v*. Unfortunately, the local plasma 
frequency is generally not known. The parameter is the 
downlink analog of y 2 . 

Using Eq. (1), we can obtain an expression for : 


= G Jo -(^i/^) 2 ]" 1 [/i -«V g 2 )U 


( 2 ) 


The phase scintillations induced on the downlinks contribute 
a frequency noise term that can be identified if the differential 
Dopper shift, f x - ( G l /G 2 ) / 2 , is measured. Once identified, 
this frequency noise term is easily subtracted out of the Dop- 
pler record. At this point, the only error due to solar corona 
remaining in the Doppler record is the term involving . The 
Doppler error e, in velocity units, is 


e 


2 f, 


(v*) T 

2 u T 


( 3 ) 


The brackets < *) T indicate a time average over an integration 
time F. 


The statistics of v £ and are the same. This is in fact the 
key to being able to estimate the error e. In the writing of 
standard deviations a(* ), then, the subscripts u and d may be 
ignored. 


cr(e) = — o((v 2 ) ) (4) 

2 f 0 2 


The required coronal statistics can also be obtained from a 
pair of one-way downlinks of different but related frequencies. 
This is, in fact, how the data appearing in this report were ob- 
tained. In this case, Eqs. (4) and (5) still hold, but some terms 
in Eq. (5) need proper interpretation. The term G l f 0 is the 
frequency of the first downlink, and the term GJG 2 is the 
ratio of the downlink frequencies. 


III. Voyager 2 Results 

During the 1985 solar conjunction for Voyager 2, differen- 
tial Doppler data were collected from a pair of one-way down- 
links. The downlinks originated with an ultrastable oscillator 
aboard the spacecraft, and the 64-m subnet was used for recep- 
tion. Estimates were made of what the observed level of phase 
scintillations would do to a two-way coherent Doppler mea- 
surement. These estimates were obtained by applying the dif- 
ferential Doppler statistics to Eqs. (4) and (5). The frequency 
of the first downlink was approximately 2296 MHz. The ratio 
of the downlink frequencies, GJG 2 , was 3/11 . 

The standard deviation of the two-way coherent Doppler 
error, as expressed in Eq. (4) and based on Voyager 2 differen- 
tial Doppler measurements, has been compiled in Table 1 and 
plotted in Fig. 1. The statistics have been calculated for inte- 
gration times ranging from 1 second to 1 minute. For the 
longer integration times, the error is less. Four example Sun- 
Earth-probe angles have been included. It must be understood 
that the properties of the solar corona are highly variable and 
that the results shown in Table 1 and Fig. 1 do not represent 
a characterization of the error as a function of Sun-Earth- 
probe angle. Such a characterization must be based on a larger 
set of data. 

The Doppler error due to solar corona can be reduced by 
using a higher frequency on the uplink. For 7. 2- GHz and 
34.3-GHz uplinks, the errors shown in Table 1 and Fig. 1 are 
divided by 1 1.5 and 263, respectively. For some missions, this 
is an important advantage for the higher frequencies. 


As suggested by Eq. (2), the required coronal statistics can be 
obtained by taking a statistical measure of the differential 
Doppler shift, viz., 


o((v\) = G 1 f 0 


1 -(gjg 2 ) 2 


~ (GjG 2 )f 2 ) T ) 


( 5 ) 


IV. Conclusions 

This report has explained how phase scintillations induced 
on the downlinks can be measured. It has been shown that 
taking statistics on these measured phase scintillations leads to 
estimates of the Doppler error due to uplink phase scintilla- 
tions. Typical estimates for the error have been calculated 
based on differential Doppler data collected during the 1985 
Voyager 2 solar conjunction. 
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Table 1. Two-way coherent Doppler error for 2.1 -GHz uplink and 
dual-frequency calibrated downlink* 


Integration 
time, sec 

Doppler error (mm/sec) for several 
Sun-Earth-probe angles 

1.3° 

1.7° 

2.1 c 

2.5° 

1 

172.0 

137.2 

99.4 

41.3 

5 

161.7 

98.2 

65.7 

31.2 

10 

152.6 

78.1 

54.2 

27.2 

15 

142.7 

67.1 

47.9 

24.3 

20 

137.1 

64.7 

45.3 

23.2 

25 

131.7 

57.9 

40.2 

22.8 

30 

127.7 

53.5 

41.1 

21.7 

35 

123.9 

52.9 

37.9 

21.3 

40 

122.3 

51.5 

37.2 

19.4 

45 

117.7 

48.7 

33.9 

19.6 

50 

117.9 

47.6 

34.0 

20.5 

55 

116.5 

45.9 

34.1 

19.2 

60 

112.4 

43.5 

33.4 

18.4 


♦These results are based on phase scintillation measurements made 
using the Voyager 2 spacecraft during its 1985 solar conjunction. 
The time and place at which the measurements were made are 
indicated below. 

1.3°: 2030 to 2135 on 12-8-85 at Goldstone 
1 .7° : 0440 to 0640 on 1 2-12-85 at Canberra 
2. 1° : 0435 to 0545 on 12-8-85 at Canberra 
2.5°: 2015 to 2120 on 12-12-85 at Goldstone 
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Dynamic models for the various functional elements of the 70-m antenna axis servos 
are described. The models representing the digital position controller , the linear and 
nonlinear properties of the physical hardware , and the dynamics of the flexible antenna 
structure are encoded in six major function blocks . The general modular structure of the 
function blocks facilitates their adaptation to a variety of dynamic simulation studies. 
Model parameter values were calculated from component specifications and design data. 
A simulation using the models to predict limit cycle behavior produced results in excel- 
lent agreement with field test data from the DSS 14 70-m antenna. 


I. Introduction 

The recent acquisition of a microcomputer workstation and 
a software package for modern control system analysis and 
simulation has enabled combining the linear dynamics of the 
antenna structure, the nonlinearities of friction, and the quan- 
tized, sampled data properties of the control algorithm in a 
single simulation model. The dynamic simulation program pro- 
vides a building block approach to modeling complex systems 
such as the 70-m antenna axis servos. The models described 
here were developed from previous linear models [1] , [2] in 
generalized building block forms. The form of the modularity 
was chosen to maintain generality and to facilitate future 
expansion or simplification of models for selected parts of the 
system. 

The 70-m servo models described here are organized accord- 
ing to function into six major blocks which are shown (Fig. 1) 
interconnected to represent the overall axis-position servo. The 
overall model includes the salient properties of the control 
algorithm, the digital-to -analog converter, the electronic con- 


trol amplifier, the hydraulic servovalve, the hydraulic motor 
with associated friction, the antenna structure and pedestal, 
and the axis encoder. 

II. Dynamic Model Development 

A. Position Loop Control Algorithm 

The structure of the control algorithm is discussed in detail 
in [1]. The estimator and gain equations for the computer 
mode from [1] are repeated here to illustrate the development 
of the discrete system equations representing the algorithm 
block. 

E(n + 1) = QE(n) + rU(n) + LY E (n + 1) (la) 
for E 2 through E e , and 

*,(« + !) = ^OO+tfjOi) -*(«) (lb) 
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U(n + \) = - KE(n + 1) +NR(n + 1) 


( 2 ) 

where E(n + 1) is the state estimate column vector of [1], 
[E x E 2 E 3 . . .E e \ corresponding to time interval n + 1. In 
[1] the R(n) term in the equation for the integral error esti- 
mate,^ , was omitted in error. 

In practice, the antenna servo controller evaluates Eq. (la) 
in two steps in order to minimize the computation delay 
within the servo loop, and implements Eq. (lb) to simplify 
the computation of E x and conserve computing time. Eq. (lb) 
is equivalent to Eq. (la) for the special case where the first 
row of $ is [1 1 0 0 0 0] , the first elements of T and L are 
zero, and R(n) is subtracted. Eq. (la) can thus be extended to 
the more general form to include E x 

E(n+ 1) = $E(n) + VU{n)+LY E (n +l)+MR(n) (3) 

where M is the column vector [<J>j 2 0 0 0 0 0]. 

Next, substituting 

Y E (n + 1) = r(« + i)-^(^(«) + rc/(^)) 
and 

U(n) = -KE{n) + NR(n) 

into Eq. (3) yields the familiar form for the estimator 
E(n + 1) = [l-LH][®-rK]E(n)+LY(n + l) 

+ [l-LH]YNR(n)+MR(n) (4) 

With the use of Eq. (4), the controller output U and the posi- 
tion estimate E 2 can be expressed in a compact state space 
form for a system having three inputs and two outputs defined 
by Eqs. (2) and (3) 


~ E{n + 1)" 


A 

b" 


" E{n) ‘ 

U{n + 1) 

= 

C 

D 


Y(n + 1) 
R(n) 

5 (« + 1 ) 


HA 

HB 


R(n + 1) 


where 

B = [L (I- LH)TN + M 0] 


C = -KA 

D = [-KB+ [0 0 N]] 

Figure 2 illustrates the control algorithm block with inputs 
which are the encoder output, Y, and the current position 
command, R; its outputs are the rate command, U , and the 
encoder position estimate, E x . The two blocks shown are 
standard building blocks provided by the simulation program. 
The previous rate command, R(k), is derived from the unit 
delay. The discrete time equations represented by the A, B, C, 
and D matrices are evaluated at 50-msec intervals by the State 
Space block. 

B. Electronic Component Models 

The digital-to-analog converter in Fig. 1 is represented by a 
standard building block which quantizes a continuous input 
function. 

The axis encoder is modeled by a standard quantizer build- 
ing block from the simulation program catalog as shown in 
Fig. 1. The quantization level corresponds to 360 degrees per 
2 20 encoder increments. 

The amplifier model block shown in Fig. 3 represents the 
dynamics of the hardware rate and acceleration limiters, the 
rate loop compensation networks, and the valve driver ampli- 
fier. Inputs are the rate command from the digital-to-analog 
converter and the motor rate, and the outputs are the current 
to the hydraulic valve and the voltage at the average tachom- 
eter circuit node. Parameter values are calculated from the 
component values in the schematic diagram, 1 and the proper- 
ties of the valve coil. Details of these calculations are included 
in the Appendix. 

C. Hydromechanical Component Models 

The hydraulic valve model block is shown in Fig. 4 where 
the input is valve coil current and the output is no-load volu- 
metric flow. The flow reduction due to the hydraulic pressure 
of the load is incorporated in the damping term of the motor 
model. This form of modeling simplifies the block intercon- 
nections by eliminating a pressure feedback path from the 
motor block to the valve block. The valve flow versus current 
is modeled by a simple dynamic lag followed by a hysteresis 
function and a deadzone. The deadzone corresponds to the 
underlapped condition of the valve spool face and the hystere- 


*JPL Drawing J947 98 7 ID, Schematic Diagram, Analog D.W.B., (inter- 
nal document), Jet Propulsion Laboratory, Pasadena, California, 1987. 
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sis results from friction associated with the spool motion. The 
resulting flow characteristic of this model, shown in Fig. 5, 
compares well with those from actual valve tests performed by 
the manufacturer. 

Figure 6 shows the model of the hydraulic motor where the 
inputs are the no-load hydraulic flow and the load torque at 
the antenna bullgear. The outputs are the rate at the bullgear, 
the rate at the motor shaft, and the differential hydraulic pres- 
sure. The model incorporates the performance equations 
described in [2] and a more accurate model [3] for the fric- 
tion associated with the motor and the gear reducer. 


D. Flexible Structure Dynamics 

A block diagram of the structure dynamic model is shown 
in Fig. 7. The model includes the gearbox stiffness, the residual 
structure inertia, axis damping and Coulomb friction, and state 
space models for the flexible modes of the structure and the 
antenna pedestal. The residual inertia represents that part of 
the total axis inertia that is not associated with any of the 
flexible modes. The inputs correspond to the hydraulic motor 
rate and an external disturbance torque. The first three out- 
puts correspond to the angular positions at the bullgear, the 
axis encoder, and the intermediate reference assembly (IRA), 
respectively. The fourth and fifth outputs correspond to the 
axis reaction torque which is reflected back to the motor, and 
the encoder rate. It should be recognized that the bullgear and 
IRA positions are in absolute, or inertial, coordinates while the 
encoder position and rate are measured relative to the pedestal 
displacement. 
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and where x represents the state vector and jc its time deriva- 
tive, and 

N = number of flexible modes 


The model is based on the equations of motion described in 
[2] with the addition of axis damping, friction, and pedestal 
dynamics. A block diagram representation of the structure 
flexible mode dynamics is shown in Fig. 8(a) where the input 
is the bullgear angle and the outputs are the IRA angle and the 
net reaction torque reflected to the bullgear. The correspond- 
ing state space equations are given by Eq. (6). The flexible 
alidade structure model, for the elevation axis, is shown in the 
block diagram of Fig. 8(b) and the corresponding state space 
equations are given by Eq. (7). The pedestal dynamic equa- 
tions have the same form as those of the alidade structure used 
in the elevation model in Eq. (7). Numerical values of the 
structure parameter are listed in Table 1. 
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C = [o 1 0 o] D = [ o] 

with 

^ = L ^ 

^A2 — ^A2 W A2 

E. Friction 

The axis friction representation shown in Fig. 7 is subject 
to the limitations described in [3] . At this stage in the model 
development the total friction is lumped at the motor shaft as 
the actual distribution between motor shaft and antenna axis 
is unknown. Presumably, the distribution of friction to both 
sides of the gearbox stiffness could have a noticeable effect on 
dynamic behavior. 

III. Model Tests and Applications to 
Simulations 

The model was tested by using a feature of the simulation 
program that produces the linear system matrices representing 
the linearized model. The linear system matrices provide for a 
convenient cross check with other linear analysis methods thus 
assuring proper interconnect, feedback polarity, and parameter 
values. The Eigenvalues of the rate loop portion of the model 
of Fig. 1 were thus computed and results compared to three 
decimal place accuracy with those listed in [2], Due to numeri- 
cal condition deficiencies of the linear system matrix produced 
by the program, attempts to determine the transfer function 
zeros led to unreasonable results. This condition deficiency 
was overcome by scaling the relevant inertia and stiffness 
parameters according to the square of the gear ratio (see 
Fig. 6), and zeros in good agreement with [2] were thus* ob- 
tained. The condition deficiency will be seen to affect only the 


derived linear system matrix and will not impair accuracy of 
simulation runs. Following this cross check process the model 
was restored to the unnormalized form shown in Figs. 6 and 7. 

A simulation of position loop limit cycle behavior was 
performed to compare the performance of the overall model 
including the new friction model with results from tests on the 
actual antenna. Results from the simulation shown in Fig. 9 
are remarkably similar to those from the antenna test of Fig. 
10. In both the simulation and the hardware test, the limit 
cycle was initiated by a small (3 encoder bits) position input 
step change. The simulation was performed at a stage prior to 
the development of the latest models for the discrete time con- 
trol algorithm, the amplifier, and the hydraulic valve, so linear 
dynamic equivalents were substituted. For the algorithm, the 
proportional-plus-integral-plus-derivative (PID) linear feedback 
equivalent was used. The model structure and parameter values 
were otherwise identical to those described above. 

IV. Summary and Conclusions 

Modular dynamic models for the 70-m antenna axis servos 
have been described. Numerical cross checks of Eigenvalues 
and transfer function zeros indicate a consistency between the 
model and previous linear analysis methods. Comparisons of 
model-based simulation results with actual field test results 
indicate excellent modeling accuracy. 

The small discrepancies between the simulation and field 
test results are most likely the result of differences between 
the modeled and actual friction parameter values, and the 
presence of a small (150 psi) bias torque effect in the actual 
antenna. In future work the model and the simulation program 
should be extremely useful in two ways: (1) as an adjunct to 
the design of more robust systems, and (2) as a hardware diag- 
nostic aid that relates specific hardware out-of-tolerance condi- 
tions to abnormal field test measurements. 
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Table 1. Parameters for 70-m A Z and EL axis servos 


FLEXIBLE MODES 3 

GEARBOX STIFFNESS, K G , ft-lb/rad 

Stiffness, K, ft-lb/rad 

K g AZ = 2.1654E11 

KAZ= [2.238 4.516 3.651 1.233 0.564] T - 1.E09 

K g EL = 3.0759E11 

KEL= [22.61 6.564 1.566 3.406 0.699] T * 1.E09 

JBINV AZ = 1/J B AZ 

Squared natural frequencies, cj, (rad/sec) 2 

JBINV EL = 1/J B EL 

co AZ=[ 63.47 69.09 97.81 189.0 285.68] 

AXIS DAMPING, ft-lb/rad ian /sec, (equiv 1000 psi/deg/sec) 

co EL= [218.77 313.04 411.12 476.72 656.85] 

2.13E08 

Transformation coefficients, a dimensionless 

GEAR RATIO (MOTOR:AXIS) 

a AZ= [0.1331 0.2767 0.1169 0.0099 0.0376] T 

= 28730 (BOTH AXES) 

aEL= [0.2939 0.0977 0.0266 0.0309 0.0050] 1 

MOTOR DISPLACEMENT (total 4 motors), V, in. 3 /rad 

ALIDADE STRUCTURE, ELEVATION AXIS ONLY 

V = 1.528 

Inertia moments, ft-lb-sec 2 , referred to the EL axis 

VALVE GAIN, K v , in. 3 /s/mA 

JA1 = 1.197E07 

K v = 38.3 

JA2 = 1.098E08 

Squared natural frequencies, u>, (rad/sec) 2 

HYD COMPRESSIBILITY, C, in. 3 /p si 
C = . 00314 

ojAI = 4600 
gj A2 = 1026 

PEDESTAL STRUCTURE, AZIMUTH AXIS ONLY 

VALVE DAMPING RATIO, D/C, sec -1 
D = 0.60 

MOTOR INERTIA (total 4 motors), J M , ft-lb-sec 2 
J M AZ = 1.00 
J M EL = 0.664 

Inertia moments, ft-lb-sec 2 , referred to the AZ-axis 
JP1 = 2.105E08 
JP2 = 1.74E08 

Squared natural frequencies, cj, (rad/sec) 2 

FRICTION, INERTIA, SAMPLE INTERVALS FOR FRICTION 
BLOCK, FRICTION EQUIVALENT TO 350 psi OF AP, STATIC 

cj PI = 3.093E04 

FRICTION EQUIVALENT TO 420 psi 

cj P2 = 4.063E04 

BULLGEAR RESIDUAL INERTIA, J B , ft-lb-sec 2 

FRC = 44.57 
STC = 53.48 

J B AZ= 1.4965E08 

JMFT = [J M AZFRC 0.005] 

J B EL = 8.7989E07 

FRCS = [FRC STC] 


“Calculated from [2] , Table 5 using Kj = (Jj/J b ) cj. 2 J b N 2 with gear ratio N = 28730 for either axis. 
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Fig. 1. Azimuth/ Elevation servo simulation diagram. 
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Fig. 2. Control algorithm simulation diagram. 



Fig. 3. Rate loop amplifier simulation diagram. 
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Fig. 6. Azimuth motor simulation diagram. 
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Fig. 7. Azimuth structure simulation diagram. 
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Fig. 8. 70-m simulation block diagram: (a) flexible modes and (b) alidade structure. 
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Fig. 9. Simulation test results for an azimuth limit cycling condition: (a) differential pressure; 
(b) axis rate; (c) hydraulic valve current; (d) rate command; and (e) encoder angle. 
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Appendix 

Derivation of Electronic Circuit Transfer Functions 


This appendix describes the analytical methods and detailed 
calculations used to derive the dynamic transfer functions of 
the electronic circuit portions of the 70-m axis servo loops. A 
number of simplifying approximations employed in the origi- 
nal design and network synthesis process are also described. 
The transfer functions are described in terms of a group of 
function blocks representing simplified equivalent circuits of 
portions of the analog circuit board and the external rate- 
meter. The basic blocks consist of a tachometer combining 
network and filter, a tachometer lead network, a compensa- 
tion amplifier, a valve driver amplifier, and a rate and accelera- 
tion limiter. 


I. Tachometer Combining Network and 
Filter 

The tachometer network and filter properties are derived 
from the schematic diagram (see Footnote 1) and the external 
ratemeter components. The applicable part of the schematic 
and the ratemeter is shown in Fig. A-l(a). The four tachometer 
voltage divider networks, R34, R35, Cl 7, etc. are omitted 
from Fig. A- 1(a) because by virtue of their high impedance 
relative to the tachometer source resistances, these networks 
have negligible effect on the overall transfer functions. The 
V AT symbol designates the “average tachometer” circuit node 
voltage, a significant node because it serves as a calibration 
reference for transfer function calculations. The ratemeter 
adjust potentiometer is adjusted in the field to compensate for 
disconnection of one or more tachometers, and also for scale 
factor variations among individual tachometers. Because this 
adjustment is in a shunt circuit path, it simultaneously corrects 
the rate loop gain and the voltage at V AT , with adjustment of 
the high and low rate ranges of the meter circuit. 

The voltage scale factor at V AT to satisfy the calibration 
condition is calculated from the known full scale meter cur- 
rent, (50 juA), the meter circuit resistance consisting of fixed 
resistor R60 and the meter coil resistance R m , and the full 
scale rate (0.25 deg/sec). The average tachometer circuit node 
voltage constant is thus 


V AT 


(R60 +/?)/. 


m' FSM 


Rate 


fs 


_ (121 +0.660) .05 
at 0.25 

= 24.33 volts/degree/sec axis rate 

= 0.04852 volts/rad/sec of motor rate 

This value is as accurate as the combined accuracy of the R60 
resistor (1 percent), the meter movement calibration (2 per- 
cent), and the field calibration process (estimated 3 percent) 
which is sufficient for the present purpose. A small error 
(4.9 percent) in the low rate range calibration results from the 
difference of the ratio of (R60 + 660)/(R61 + 660) from the 
desired 10:1. This error could be diminished by padding R61 
with 220 k£2. 


The network of Fig. A- 1(a) is reduced to the equivalent of 
Fig. A- 1(b) by replacing the four inputs with the single equiva- 
lent voltage source V r , replacing the R54, R57, R58, R59, 
R62 combination with the series resistor R ST , and replacing 
the meter circuit with the shunt resistor R M . Calculation of 
the shunt resistor R M requires a knowledge of the setting of 
the 100 K ratemeter adjustment potentiometer. This is accom- 
plished indirectly through use of the known value of V AT de- 
rived from the ratemeter adjustment criteria described above 
and the tachometer scale factors. Thus, representing the sum 
of the shunt conductances by G ss and with G ST = R~^ : 


J ss 



which leads to 



where 


R st = (R56 -1 + R57 -1 


-(R63 + R64) -1 -(R65 + R66) 


+ R58" 1 + R59 -1 ) -1 +R62 


-l 


With V T = 187.5 and 222 volts/deg/sec for azimuth and 
elevation, respectively, and with 


R56 = R57 = R58 = R59 = 8.25 kft 
R62 = 56.2 kf2 


With R60 = 121 kfi,7? m = 660 £1 


R63 = 40.0 kf2 
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R64 = 51.1 kf2 
R65 = 909 k n 
R 66 = 51.1 kf2 

The equivalent meter circuit resistance becomes 
R m = 9.73 kT2 for azimuth 
R m = 7.847 k£2 for elevation 

The 56.2 K resistor R14 provides part of the additional shunt 
conductance required for elevation. 


The network poles and zero are calculated from general 
expressions derived from the circuit loop equations using 
Cramer’s method with expansion of the determinants. The 
computation is simplified to a two-pole, single-zero determina- 
tion by partitioning the circuit to delete C42, R65, and R66 
from the circuit. This approximation is justified since the 
response zeros correspond to the parallel resonance of R65 
and C42 and to the series resonance of the R63, R64, C40 
branch and are thus unaffected by the partitioning. The effect 
of partitioning on the response poles is small because of the 
large ratio of R66 to R M . The exact location of the tachom- 
eter filter poles is also of secondary importance to control 
dynamics analysis. The circuit equations may be derived from 
any consistent set of loop currents or node voltages. A typical 
set of loop current equations is 


R x + R63 + R64 - R64 

- R63 


h 


V 1 

-R64 R64 + (sC40) -1 

- (sC40) _1 


h 

= 

0 

- R63 - 0C40)- 1 

R63 +(sC40r x +(sC41)- 1 


V 


0 


where R l - + 1 and V j V T R M + ^Af) 

Expansion of the determinant yields the following quadratic 
equation in s : 

s 2 + [( R- 1 + R63 -1 ) C41 -1 + (R63 -1 + R64 -1 ) C40 _1 ]s 

+ (R 1 + R63 + R64) (R l R63 R64 C41 C40)" 1 = 0 

the roots of which are the circuit poles. Expansion of the nu- 
merator cofactor yields the single zero 

Z1 = - (R63 -1 + R64 -1 ) C40 -1 

Numerical evaluation for the azimuth axis results in 
Z1 = -297 
PI = -258 
P2 = -1006 

The transfer function relating the average tachometer node 
voltage to the motor rate in radians/sec becomes 

V at _ -0.4852 P2 _ 488.1 

“©T (s -P2) (s + 1006) 


Because of the near cancellation of Z1 and PI, they are 
omitted from the model and the circuit is approximated by 
the single pole, P2. As an alternative to the quadratic solution 
above, a convenient approximation for the poles may be em- 
ployed whereby 

PI = - (R63 -1 + R64 -1 ) C4CF 1 
P2 = -(R" 1 + R63 -1 ) C41 -1 
This approximation yields -297 and -967 for PI and P2. 


II. Tachometer Lead Network 

The calculation of the tachometer lead network pole and 
zero is simplified by partitioning the circuit and replacing the 
tachometer network by an equivalent source resistance, R s . 
The lead pole and zero frequencies thus become 

PL1 = - (R s + R65 + R66) [(R^ + R66) R65 C42] _1 
ZL1 = - (R65 C42) -1 

where R s = [R~* + R' 1 + (R63 + R64)" 1 ] - 1 . 
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Numerical results are 

PL1 = -82.38 for azimuth 
PL1 = -84.01 for elevation 
ZL1 = -5.00 for both axes 


III. Rate Loop Compensation Amplifier 

The simplified schematic circuit diagram of the rate loop 
amplifier and the tachometer lead network is shown in Fig. A-2 
where R s is the network source resistance discussed earlier. 
Using the infinite summing junction gain approximation, the 
high frequency voltage gain of the circuit is the ratio of feed- 
back to input impedances where the reactances of the capaci- 
tors are zero. This high frequency gain is subsequently cascaded 
with the zero/pole ratios of the input and feedback networks 
to derive an overall transfer function. Thus 

V ra _ R53 

Kir " (*5 + R66 > 


IV. Valve Driver Amplifier 

The equivalent circuit of the valve driver amplifier is shown 
in Fig. A-3 where v ar is the input voltage from the rate ampli- 
fier and I v is the output current in the hydraulic valve load. 
The Ql, Q2 complementary transistor emitter followers are 
represented by the unity voltage gain block. Using the infinite 
gain approximation for the operational amplifier, neglecting 
the gain-bandwidth product, and including the phase inver- 
sion of the op-amp, the circuit transconductance becomes 

7 k -1 _ -1217 

V RA (R13 R43 C18) (s - PV1) (s + 303) 

where PV1 = (R36 C18)" 1 = -303. 


V. Rate and Acceleration Limiters 

The equivalent circuits of the rate and acceleration limiters 
are shown in Fig. A- 4 where Vrc represents the rate input 
command voltage from the external digital to analog converter 
in the antenna servo controller and V RL represents the limiter 
output. 


The circuit pole and zero corresponding to the feedback net- 
work are derived using Cramers method of solution of the 
circuit loop equations applicable to the infinite gain approxi- 
mation. Solution of the equations yields for the pole and zero 
frequencies 



The voltage scale factor at Vrc is derived from the equi- 
librium condition where a Vrc input command is opposed by 
a tachometer input, V AT (see Figs. A-l and A-2) such that their 
difference results in a value of Vra sufficient to produce the 
desired rate. This difference is inversely proportional to the 
negative loop gain of the rate loop at DC, K DC . Thus, using 
R38/R23 as an approximation to the DC transfer function of 
the limiter circuit, 


ZR1 = -C31" 1 [R53- 1 + (R52 + R50R51)(R50 + R51)]' 1 
Numerical results are 


PR1 = -0.238 s' 1 
ZR1 = -4.47 s' 1 

The voltage transfer function of the tachometer lead network 
and the loop compensation amplifier thus becomes 


Kir -Vrc™ 

(R65+R66) (R23R15) 


-K V 

DC AT 

(R65 + R66) 


from which 


RC 



(R65 + R66) 


with the loop gain K DC = 40 and Vrc = 19.48 volts/deg/sec. 


Vra _ -R53 (s -ZL1) (s - ZR1) 

V aT (R s + R66) (s - PL1) (s - PR1) 


Normal variations of the rate loop gain about the typical 
value of 40 will result in errors of negligible proportion relative 
to the uncertainties of tachometer and hydraulic valve gains. 


-7.61 (s + 5.0) (s + 4.47) 
(s + 0.238) (s + 82.4) 


The acceleration amplifier U7 with feedback network R32 
and R33 is equivalent to a voltage gain with a single real 
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pole resulting from the gain-bandwidth product of the LF356 
operational amplifier. Neglecting terms in the operational 
amplifier gain, 1/A, the closed-loop voltage gain and pole fre- 
quency become 


V. , = R 1N C24 V BI 

AL IN clt RL 


where R 1N is the actual input resistance, R39 plus the adjusted 
value of R37 


V al _ R33 

V RC ~ R 32 


r in V al 


C24 7 v *l 


-1 


Substituting the acceleration limit threshold voltage, 7.50, for 
P AL = - 2 wGBW , — V al , 19.48 volts/sec per deg/sec multiplied by 0.2 deg/sec 2 for 

d/dtV RL> 


With R32 = 1 .0 K, R33 = 1 .0 M, and GBW = 1 .0 MHz 


R in - 7.50(19.48 • 0.2 C24)' 1 = 1.925 MJ2 



1001 


from which the center value of R36 is 325 kf2 for an accelera- 
tion limit of 0.2 deg/sec 2 . 


P. . = -6280 

AL 


Neglecting the high frequency pole, Pal’ the transfer func- 
tion from the rate command to Val becomes 


The acceleration voltage limiter threshold, Yalt , is the sum 
of the forward and zener voltages (0.7 + 6.8) of the 1N5526 
Zener diodes, CR1 and CR2. 

The action of the rate limiter is represented in Fig. A-4 by 
the limiter block in parallel with capacitor C24. Typical set- 
tings of the adjustable limiter correspond to a rate limit of 
0.25 deg/sec which, using 19.48 volts/deg/sec, equals a 4.87 
volt limit at V RL . The actual setting of the acceleration limit 
adjustment R37 can be calculated from the nominal compo- 
nent values and an assumed adjustment to 0.20 degrees/sec 2 . 
Using the equation for the integrator transfer function 


V AL 


■Hocts)) (^fi) 


©, 


RC 1 + 


R38 


(R23 + R38) 


i+ HW c24s)-1 


which, due to the high value of loop gain, can be approxi- 
mated by 


V AL 

®RC 


= 19.48 R jn C24 s = 38.50 volts/degree/sec 2 
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(b) 


C42 
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SUMMING 
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Fig. A-1. Tachometer combining network: (a) simplified schematic diagram and (b) equivalent circuit. 



Fig. A-2. Rate amplifier equivalent circuit. 
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A new method of modeling friction forces that impede the motion of parts of dynamic 
mechanical systems is described. Conventional methods in which the friction effect is 
assumed a constant force , or torque , in a direction opposite to the relative motion , are 
applicable only to those cases where applied forces are large in comparison to the fric- 
tion , and where there is little interest in system behavior close to the times of transitions 
through zero velocity. This article describes a new algorithm that provides accurate deter- 
mination of friction forces over a wide range of applied force and velocity conditions. 
The method avoids the simulation errors resulting from a finite integration interval used 
in connection with a conventional friction model , as is the case in many digital computer- 
based simulations. The new algorithm incorporates a predictive calculation based on ini- 
tial conditions of motion, externally applied forces, inertia, and integration step size. The 
predictive calculation in connection with an external integration process provides an accu- 
rate determination of both static and Coulomb friction forces and resulting motions in 
dynamic simulations. Accuracy of the results is improved over that obtained with conven- 
tional methods and a relatively large integration step size is permitted. A function block 
for incorporation in a specific simulation program is described. The general form of the 
algorithm facilitates implementation with various programming languages such as Fortran 
or C, as well as with other simulation programs. 


I. Introduction 

Recent interest in certain limit cycle oscillatory modes of 
operation of the 70-m antenna at DSS 14 has intensified the 
need for dynamic analysis and simulation of the axis servos. 
Limit cycle oscillations of phyiscal positioning systems, such 
as the antenna axis servos, result from nonlinearities associated 
with the position sensors and the control actuation devices. To 
support limit cycle investigations it is necessary to model and 
simulate all the identified nonlinearities in the system. Because 


of the large magnitude of axis friction, which equals roughly 
20 percent of the maximum available control effort, accurate 
modeling of friction effects is of critical importance. 

The basic physical laws of friction are discussed in numer- 
ous textbooks on mechanics and are briefly summarized here. 
First, the friction force between two bodies lies in the tangent 
plane of the contact point between the bodies. In the absence 
of relative motion, its magnitude is less than or equal to the 
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product of the normal force between the bodies and a con- 
stant coefficient of static friction. Relative tangent plane 
motion between the bodies cannot commence until an exter- 
nally applied force exceeds the maximum magnitude of the 
friction force. Second, the Coulomb (as opposed to viscous) 
friction in the presence of relative motion between the two 
bodies, the friction force equals the product of the normal 
force and a coefficient of friction which may be different from 
the static coefficient and is in a direction opposite to the 
relative motion. 

In those applications where the normal force is constant 
and static friction forces are of no great concern, the friction 
force can be modeled by a constant force directed opposite 
the relative motion. This conventional model combined with 
an inertia is illustrated in transfer function form in Fig. 1 
where motion is restricted to a single coordinate. The non- 
linear function block in Fig. 1 has three possible outputs: a 
unit amplitude with algebraic sign the same as that of the 
velocity; x , (when x is nonzero); and zero output for zero x. 
The constant of multiplication, F c , in the constant block is 
the product of the normal force and the friction coefficient. 
The net input force to the integration block is thus equal to 
the difference between the applied force and the friction 
force. 

By inference, the behavior of the model of Fig. 1 can be 
predicted for a number of simple cases. First, when the applied 
force is zero and the initial rate of motion is nonzero, the 
motion will decay to zero under the influence of friction. 
Next, when the applied force exceeds the magnitude of the 
friction, the motion will accelerate in direct proportion to the 
difference between the applied and friction forces. For these 
two cases the model is seen to provide a reasonable representa- 
tion of motion in the presence of friction. Examining next the 
case where the applied force is less in magnitude than the fric- 
tion and the initial rate is zero, the model is seen to deviate 
from the physical law because the modeled friction force is 
zero for the zero rate condition and an erroneous acceleration 
of the inertia results. 

Assuming the conventional model is evaluated using fixed 
step size numerical integration, the zero rate case above pro- 
duces a friction force which, because it exceeds the applied 
force, causes a rate reversal and leads to a sustained oscillatory 
process. While the amplitude of the rate excursions can be 
reduced through a reduction of integration step size, it will be 
seen that regardless of step size, the rate oscillates about a non- 
zero mean due to the nonzero input applied force. A special 
computation is thus necessary to determine a step size suffi- 
ciently small to control both the mean and amplitude of rate 
error. In the 70-m antenna axis servos the ratio of friction to 
inertia is 102 millidegrees/sec 2 (1.77 milliradians/sec 2 ) for the 


azimuth axis and roughly 1.5 times that ratio for elevation. It 
can be shown that controlling the above rate errors to less than 
0.1 mdeg/sec requires a step size of roughly 1 .0 msec, which is 
unreasonably small and leads to excessive computation time 
and data storage for small computer-based simulations. 

II. Derivation of Equations for Modeling 
Friction 

From the foregoing discussion it is evident that accurate 
computer modeling of motion involving friction is based on 
knowledge of both the applied force and the velocity of the 
body influenced by the friction. The velocity determination 
is thus an essential adjunct of any friction model. When that 
velocity is determined by a finite step size integration process, 
the effects of friction reversals resulting from mid-integration- 
step zero crossings of velocity must be considered. The model- 
ing problem thus becomes the determination of net effective 
impulse such that the velocity change resulting from the finite 
step integration is reasonably accurate. 

The derivation of the equations for modeling friction is 
equally applicable to translational or rotational systems. For 
rotation, the derivation assumes a slowly varying externally 
applied torque to a constant inertia body in the presence of 
both static and invariant Coulomb friction and a known, 
fixed integration interval. The following conditions are con- 
sidered separately. 

(1) The applied torque is greater than the static friction 
torque and the inertia is initially at rest. 

(2) The applied torque and initial rate are such that the 
rate of motion will not reach zero within the next inte- 
gration interval. 

(3) The applied torque is less than the static friction and 
the initial rate of motion is such that the rate will reach 
zero within the integration interval. 

(4) The applied torque is greater than the static friction 
and the initial rate of motion is such that the rate will 
reach zero within the integration interval. 

For Condition (1) above, the net torque acting on the inertia is 
simply the applied torque diminished in magnitude by the 
Coulomb friction torque. The effective friction torque, 7^, is a 
constant with direction opposite to the applied torque 

T f = -F- sign(r p ) (1) 

for 6 = 0 and \T ap \ > F c where F c is the Coulomb friction tor- 
que, the sign function is unit amplitude with the algebraic sign 
of its argument, T ap is the applied torque, and 6 is the rate of 
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motion. The use of the Coulomb rather than the static value in In Condition (4) above, the applied torque is sufficient 

this case is based on the assumption of an instantaneous transi- to overcome the static friction level and reverse the rate within 

tion from the static to the sliding friction case. It will be seen the integration interval. The actual friction torque in the 

that this assumption results in a minimum net torque equal to physical system will thus reverse coincidental with the rate 

the difference between the static and Coulomb values. The reversal. The effective friction torque is obtained by averaging 

resulting rate impulse can be adjusted to better comply with the instantaneous friction torque over the integration interval, 

known physical behavior by selection of the integration step Thus 

size. 

The necessary condition for (2) above is determined from 
the equation of motion in the presence of friction 

( 6 ) 


T f = -F c • sign(0) 


[f -(t.-t )] 

L OC v l oc /J 



T f = F c • sign(0) 



where for rotational motion, J is the inertia moment, 6{t) is 
the rate at time t , T ap is the applied torque and Tp the fric- 
tion torque. Substituting -F c • sign (0) for the friction, Tp 
solving for Q(t oc ) = 0, and dividing by the integration step 
size, r-, yields 




0 ) 


Negative values of t Q Jt i imply a level of applied torque in 
excess of the friction and in the same direction as the rate. 
Positive values imply an applied torque either in a direction 
opposite the rate, or having a magnitude less than the fric- 
tion, or both. A negative or unity or greater than unity value 
of t oc \t i is a necessary and sufficient condition for Condition 
(2) above. The net torque in this case is the algebraic differ- 
ence between the applied and friction torques where the fric- 
tion is opposite in direction to the rate. 


T f = -F • sign (0) (4) 


for t oJ t i < 0 or t oc/ t i > !• 


In Condition (3) above, the rate will reach zero at some 
time within the integration interval and the applied torque 
will be insufficient to produce a rate in the reversed direction. 
Because the net torque acts on the inertia for the full interval, 
it must then decelerate the inertia to precisely zero rate at the 
end of the interval. The required net torque and necessary con- 
ditions are thus 



for \T ap \ >F S and 0 < t Q Jt i < 1 . 

If the initial rate is zero and Condition (1) above is not 
satisfied, the friction equals the applied torque and the net 
torque becomes zero. Further, since Conditions (1) through 
(4) encompass all possible torque and rate conditions of 
interest, Eqs. (1) through (6) together with their conditions of 
applicability form the basis for defining effective friction 
torque and the net torque. 


III. Application to Practice 

A function block incorporating the logic and equalities of 
Eqs. (1) through (6) was developed for incorporation into a 
dynamic simulation model of the 70-m azimuth axis servo 
using MATRIXx, a copyrighted software program from Inte- 
grated Systems, Inc. for simulation of dynamic systems. The 
friction model utilizes four general equation building blocks 
and one standard function block from the MATRIXx utilities. 
Because the simulation program does not facilitate conditional 
branching in function blocks, it was necessary to structure the 
algorithm to employ eight logical variables whose one/zero 
values define Conditions (1) through (4) discussed in relation 
to Eqs. (1) through (6). The logical variables are then used in 
one equation for the net torque. 

The algorithm inputs are Ul, applied torque, t/2, output 
rate from an adjoint integration process, and 1/3, a unit vari- 
able with the algebraic sign of U2. The output is T22, the 
net torque to the integrator. Parameters are the static and 
Coulomb friction levels, F s and F c , the inertia moment, /, 
and the integration interval t { . 

The computations are grouped in function blocks as shown 
in Fig. 2 to avoid intermixing relational and arithmetic opera- 
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tors. The logical assignments use the convention where the 
lefthand variable is true (one) if the righthand condition is 
satisified, and false (zero) otherwise. A listing of variables and 
equations is provided below. 

Friction logical variables 

YY\ = t/l>F 

c 

772 = Ul<-F 

C 

YY3 = U2>0 
YY4 = U2 < 0 

YY5 = U\ >F s and not (773 or 774) 

YY6 = UK -F and not (773 or YY4 ) 

YY1 = 771 or YY2 

Numeric variables, YN 1 = critical torque, YN2 = t oc lt i 

77V1 = 02-7- 
‘1 

- YN 1 
rjV2 U3F C - U\ 

Logical variable 

7710 = YN2 > 0 and 77/2 < 1 
Algebraic friction equation 
72 1 = (2 • YN2-\)F c 

Y22 = [773 (£71 - 721) + 774 (C/1 + 721)] 777 • 7710 
+ [ 773 (C/1 -F c )+YY4(U\ +F C )] (1 - 7710) 

+ 775 (C/1 - F f ) + 776 ( Ul + F s ) 

-(1 - 777)7710 • 7jV1 


IV. Simulation Test Results 

Performance of the conventional friction model of Fig. 1 
and the new model of Fig. 2 was compared in a dynamic simu- 
lation of the 70-m azimuth axis position servo. To simplify the 
simulation results, the flexible dynamics of the antenna struc- 
ture were replaced with equivalent rigid-body parameters, 
thereby reducing the dynamic system to eighth order. The sim- 
ulations are otherwise representative of actual system perform- 
ance. The system excitation was a small (1.0 millidegree) posi- 
tion step transient. The control torque, measured in units of 
psi of hydraulic differential pressure, and axis rate in milli- 
degrees/sec were recorded for comparison. The simulation was 
run for a total time of 5.0 sec with a 10 msec integration step 
size for both friction models. Results for the conventional 
model are shown in Fig. 3 and for the new model in Fig. 4. 

The position loop dynamics simulated are such that the 
control torque changes slowly in response to the small tran- 
sient applied here. The oscillatory behavior of the conventional 
friction model is evident during the intervals when the applied 
torque is less than the friction. The nonzero mean rate during 
these intervals is erroneous as the rate should be zero until the 
applied torque exceeds the 400 psi friction threshold. The 
irregularities on the rising and descending portions of the tor- 
que graph appear to be the spurious result of the oscillations 
coupling back through the rate loop. 

The new friction model produces smooth torque transitions 
and zero rate in the intervals between the static friction levels 
(425 psi) in conformance with expectations based on the 
physical laws of friction. The ripple in the rate result is most 
likely the 7.0-Hz mode of the gear actuator stiffness included 
in the model. 


V. Summary and Conclusions 

An improved method for modeling dynamic motion in the 
presence of friction has been described. Simulation test results 
demonstrated that the anomalies of more conventional meth- 
ods are corrected without increasing computer processing 
time. While the new algorithm is based on an external Euler 
integration, it should be capable of extension to incorporate a 
trapezoid- or possibly a polynomial-based integration method. 
The increased complexity of the predictive calculation with a 
polynomial integration may, however, negate any advantage 
to be gained with the more efficient integration methods. 
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Fig. 1. Conventional friction model for single coordinate motion. 
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Fig. 2. Simulation function block implementation of the new algorithm. 
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Fig. 4. New friction simulation algorithm: (a) applied torque and (b) output rate. 
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This report presents the computational results of a comparison of maser materials for 
a 32-GHz maser amplifier. The search for a better maser material is prompted by the rela- 
tively large amount of pump power required to sustain a population inversion in ruby at 
frequencies on the order of 30 GHz and above. The general requirements of a maser 
material and the specific problems with ruby are outlined. The spin Hamiltonian is used 
to calculate energy levels and transition probabilities for ruby and twelve other materials . 
A table is compiled of several attractive operating points for each of the materials ana- 
lyzed. All the materials analyzed possess operating points that could be superior to ruby . 
To complete the evaluation of the materials , measurements of inversion ratio and pump 
power requirements must be made in the future. 


I. Introduction 

This report describes the results of a theoretical evaluation 
of several paramagnetic materials being considered for use in a 
32-GHz maser amplifier. Previously, ruby has been very suc- 
cessfully employed in 2.3- and 8.4-GHz masers [1] , and in an 
18- to 26-GHz tunable maser [2] . However, due to a mono- 
tonically decreasing inversion ratio above 12 GHz for ruby (for 
push-pull pumping) ruby becomes less favorable at higher fre- 
quencies. The inversion ratio is defined as the ratio of the 
inverted-spin population difference with the pump on to the 
thermal-equilibrium population difference with the pump off, 
and is determined experimentally by the ratio of gain (dB) 
with the pump on, to absorption (dB) with the pump off. 
Moore and Neff [3] and Shell (private communication) mea- 
sured an inversion ratio of 1.1 at 32 GHz. In a recently com- 
pleted reflected-wave maser [5] , the inversion ratio was esti- 
mated to be 0.7 to 0.8. This low inversion ratio was a contrib- 


uting factor to the reduced gain-bandwidth of the maser com- 
pared to a similar 22-GHz maser [31] in which the inversion 
ratio was at least 1.6. Theoretical calculations of spin-lattice 
relaxation rates [29] suggest that the low inversion ratio 
(/ « 1) is inherent in ruby at this operating point (‘operat- 
ing point’ refers to a given dc field strength, crystal orienta- 
tion, and pumping scheme). Hence, other paramagnetic 
materials, as well as other operating points of ruby, are inves- 
tigated as a first step toward finding the best maser material 
at 32 GHz. 

Before proceeding with the materials evaluation, two points 
should be made. First, the inversion ratio in the 32-GHz re- 
flected-wave maser (RWM) could probably be improved by 
either using more pump power or by using a maser structure 
with a higher 0 at the pump frequency. In the former case, 
heating of the maser due to microwave losses will degrade the 
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gain and noise temperature performance, but to what extent 
is unknown. In the latter case, the resonant structure would 
significantly reduce the tunability of the RWM, but would 
improve the pump power coupling over a still useful instanta- 
neous bandwidth. However, even with these engineering 
improvements, the inversion ratio would still only approach 
unity. 

The second point is that the inversion ratio, even though 
central to the evaluation of a maser material, cannot be 
accurately addressed by the methods presented here. It is 
believed that the low inversion ratio of ruby at the 32-GHz 
RWM operating point is a result of an unfavorable set of spin- 
lattice relaxation rates. These relaxation rates and the corre- 
sponding inversion ratios have been calculated, but, due to the 
complexity of the calculations, will be presented in a future 
report. 

In Section II the requirements of a maser material are dis- 
cussed, including the difficulties with ruby. A list is provided 
of the materials evaluated. In Section III the use of the spin 
Hamiltonian to calculate energy levels and transition proba- 
bilities is outlined. In addition, limits on the inversion ratio 
and a material figure-of-merit are discussed and the method of 
computation is reviewed. In Section IV a table of several 
promising operating points is presented for each of the mate- 
rials analyzed. The conclusions are presented in Section V. 
The Appendix contains a table of measured relaxation times 
obtained from the literature for several materials of interest. 


II. Maser Material Considerations 

The choice of a maser material is the single most important 
factor in maser design. The maser material consists of a non- 
magnetic crystalline lattice lightly doped (0.01-0.1 percent) 
with paramagnetic ions. Detailed discussions of suitable para- 
magnetic ions and host crystals are given elsewhere [6] , [7] . 
Here, only an outline of the necessary and desirable material 
properties is given. 

The most common paramagnetic ions are transition metals 
and rare earths, because of their unfilled 3 d and 4/ electron 
shells. To operate in CW mode, the ion should possess an orbi- 
tal ground state with three or more spin levels. This eliminates 
most of the rare earths. The additional requirement of a negli- 
gible nuclear magnetic moment (a source of inhomogenous 
broadening) reduces the possible ions to Cr 3+ , Fe 3+ , Ni 2+ , and 
Gd 3 *. (Actually, it is not clear that such broadening would 
adversely affect a maser with an inhomogenous applied field.) 
Only Cr and Fe are considered here, as they are by far the 
most common choices of active ions. The electronic configura- 
tions of Cr 3+ and Fe 3+ are 3d 3 and 3 d 5 , respectively. The cor- 


responding free-ion ground states are 4 F 3/2 and 6 S 3/2 . Hence, 
Cr 3+ has four spin levels and Fe 3+ has six. 

The host crystal must be non-magnetic, non-metallic, and 
available in large single crystals of a high degree of perfection. 
The material should have a sufficiently high thermal conduc- 
tivity and small loss tangent at liquid helium temperatures and 
microwave frequencies to minimize heating of the lattice. To 
facilitate the microwave engineering, the crystal should possess 
a relatively isotropic and temperature-independent dielectric 
constant. Finally, the material should be machinable and 
chemically stable, and be able to withstand thermal cycling 
between liquid helium and room temperatures. 

The active ion substitutes for one of the metal ions in the 
crystal lattice. The local crystal electric field seen by the 
ion splits the highly degenerate orbital ground state into de- 
generate pairs (assuming the number of spin levels is even). 
This splitting of spin levels due to the crystalline electric field 
is termed the zero-field splitting (ZFS). The ZFS must be large 
enough to permit pump-induced transitions between non- 
adjacent spin levels. As a rule-of-thumb, the ZFS should be of 
the same order of magnitude as the signal frequency. 

Another very important material parameter is the spin- 
lattice relaxation time, which describes how long spins remain 
in an excited state before returning to thermal equilibrium 
with the lattice. At microwave frequencies, spin-lattice inter- 
action is the dominant spin relaxation mechanism. For rela- 
tively low concentrations of paramagnetic ions (<0.05 percent) 
and at liquid helium temperatures, the most significant spin- 
lattice interaction is thought to be the Kronig-Van Vleck 
mechanism [8] , in which lattice vibrations induce transitions 
between spin states, and spin-spin interactions are neglected. 
The spin-lattice relaxation times of the various transitions 
must be long enough (> msec) to permit saturation of the 
pumped levels with a reasonable amount of pump power. 
Impurities, ion clustering, and dislocations (all of which are a 
function of the crystal growth procedure) can shorten relaxa- 
tion times, so pure, defect-free crystals are preferable. 

Another material parameter is the (unbroadened) linewidth 
Af L of the material. At liquid helium temperatures, Af L is 
determined primarily by spin-spin interactions and is inversely 
proportional to the spin-spin relaxation time, which is the 
average length of time between random dephasing “collisions” 
of neighboring spins [9] . The linewidth is usually within the 
range of 10-100 MHz for solid-state maser materials. For linear 
stagger-tuned masers of bandwidth » Af L , Af L can be shown 
to have no first-order influence on the gain-bandwidth proper- 
ties of the maser (see Section III). However, if the taper is 
along the length of the material, a material with a smaller Af L 
may exhibit a larger noise temperature at one end of its band- 


59 



pass [9] . At present, it is not understood how Af L impacts 
pump power requirements. 

Finally, many materials possess two or more magnetically 
inequivalent sites (i.e., sites having different spectra) for the 
active ions to occupy, thus decreasing the density of useful 
ions. (Gain in dB is proportional to spin density.) In most 
materials, certain orientations exist for which these sites be- 
come equivalent. For such materials, only these orientations 
will be analyzed. 

As mentioned in Section I, difficulties were experienced 
using ruby at 32 GHz. A relatively low inversion ratio was ob- 
tained for the orientation employed in the 32-GHz RWM. The 
pump transitions in ruby at this orientation are quite weak, 
making it difficult to saturate the levels. This is a result of the 
small zero-field splitting (ZFS) of ruby: ZFS = 1 1 .4 GHz, which 
is only about one-third of the signal frequency. To obtain 
sufficient separation between spin levels for amplification at 
32 GHz, a relatively large magnetic field (11.8 kG) must be 
applied. This field becomes the dominant influence on the 
spins, far exceeding the effects of the local crystal field. Under 
such conditions, the spins in the lattice assume nearly pure- 
spin characteristics, as if the Cr ions existed freely in the 
magnetic field. The selection rules of quantum mechanics 
allow transitions only between adjacent pure-spin states [10] , 
thus leading to a small stimulated transition probability for 
the pump transitions in ruby. A material with a larger ZFS will 
in general have stronger pump transitions and will therefore 
better absorb the pump power, all other factors remaining 
the same. 

The materials analyzed in this work are listed below. Details 
of the crystal structure, orientation of magnetic axes, spin 
Hamiltonian, and site equivalence are given in the references. 


Ruby (Al 2 0 3 :Cr) 

[11] 

Emerald (Be 3 Al 2 Si 6 0 18 :Cr) 

[12] 

Spinel:Cr (MgAl 2 0 4 :Cr) 

[13] 

YAG:Cr (Y 2 Al s 0 12 :Cr) 

[14] 

YGG:Cr (Y 2 Ga 5 0 12 :Cr) 

[14] 

RutilerCr (Ti0 2 :Cr) 

[15] 

Zinc Tungstate :Cr (ZnW0 4 :Cr) 

[16] 

Andalusite:Cr (Al 2 SiO s :Cr) 

[17] 

Yttrium Oxide :Cr (Y 2 0 3 :Cr) 

[18] 

Rutile.Fe (Ti0 2 :Fe) 

[19] 

Zinc Tungstate:Fe (ZnW0 4 :Fe) 

[20] 


Andalusite:Fe (Al 2 Si0 5 :Fe) [21] 

Sapphire:Fe (A1 2 0 3 :Fe) [22] 

III. Evaluation of the Spin Hamiltonian and 
Associated Parameters 

The spin Hamiltonian H s describes the interaction of the 
electron spin of the paramagnetic ion with the local crystal 
field and with the applied magnetic field. Evaluation of H s 
allows calculation of the energy levels and transition probabili- 
ties for the spin system of a singlet orbital ground state ion in 
a radiation field. Detailed discussions of the spin Hamiltonians 
may be found in the literature [8] , [23] . Berwin [24] gives a 
detailed derivation of the spin Hamiltonian based on the for- 
mulation of Bleaney and Stevens [23] for ruby. In [25] , 
Berwin discusses the spin Hamiltonians for several of the 
materials listed in Section II. 

For most materials of interest, the assumption of an “inter- 
mediate” crystal field is made, meaning that the interaction 
energy of the crystal field with the ion falls between the 
Coulomb and spin-orbit interaction terms. In deriving H s , the 
spin-orbit and Zeeman terms are treated together as a pertur- 
bation on the singlet orbital ground state. The absence of 
orbital degeneracy is sufficient for quenching of the orbital 
angular momentum L [8] . That is, to first-order,! is equal to 
zero, so the ion behaves in a pure-spin-like manner. Up to a 
second-order perturbation, the spin -orbit coupling admixes the 
singlet ground state with higher-lying orbitals, restoring some 
of the orbital angular momentum. This second-order effect is 
the source of the ZFS. 

For ruby, the spin Hamiltonian has the form [11] 

H s = gpB-S + DSl ( 1 ) 

where the spectroscopic splitting-factor is g ^ 2 (g = 2.0023 
for pure spin), 0 is the Bohr magneton, B is the applied dc 
magnetic field, and 

S = xS x +yS y + zS t (2) 

is a vector of spin operators. The spin operators describe the 
observable properties of the paramagnetic ion spin states and 
may be conveniently written in matrix form [10] . The Carte- 
sian directions are along the principal axes of the magnetic 
complexes. These axes, usually expressed in terms of the crys- 
tallographic axes, are used to describe the symmetry^ of the 
magnetic resonance spectrum. The orientation of B is ex- 
pressed by the usual azimuthal and polar angles, 0 and 6. The 
constant D determines the ZFS and reflects, in principle, the 
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extent of admixing with higher lying orbitals. This spin Hamil- 
tonian exhibits axial symmetry about the magnetic z-axis and 
has a ZFS = 2\D\. 

For magnetic complexes of lower symmetry, additional 
spin operator terms may be needed to accurately specify the 
resonance spectrum; e.g., - S 2 y ) and (S* + S'* + sf ), the 

so-called orthorhombic and cubic terms [26] . The form of the 
required spin operator terms can sometimes be determined 
from crystal field theory through the use of equivalent oper- 
ators, as discussed in [24] . However, the crystal field approach 
usually assumes ionic bonding and neglects covalency effects. 
The fact that the coefficients of H s , and quite often the form 
of H s , must be determined experimentally (by fitting to EPR 
data) is quite likely a result of this assumption. 

Given an H s , one can solve for the spin energy levels and 
eigenstates by solving: 


The vector components are defined by the relation: 


a.. = 4-(a. . x + B. .y + 7. . z) 
u 2 v */ *r } 


( 7 ) 


where a, /3, and 7 are in general complex numbers. Noting 
Eq. (4), a, 0, and 7 are easily evaluated for any number of spin 
levels; in [24] the results for a 4-level spin system are given. 


As a means of comparing the relative strength of transitions, 
Siegman [9] has defined the quantity 


H* • 00 * • 


a - 


i// 1 1 2 


( 8 ) 


The stimulated transition probability can then be written as: 

w /,- = b 2g(f) 2 (9) 


H s | * ( > = E. | V/,.> / = 1,2, ... , 25+ 1 

( 3 ) 

where | \jj.) is usually written as a linear combination of pure- 
spin states, 


ItfO = a.\S) + fc,|S-l>+ ... +r.|-£> (4) 

and where E t is the energy of the ith level, and S is the spin of 
the ion. The labeling of the pure-spin states is identical to the 
labeling of states for a large applied field. Equation (3) is most 
easily solved by expressing it in matrix form and using the 
usual matrix methods to solve for the eigenvalues and eigen- 
vectors of H s . Note that H s is Hermitian. 

Knowing the eigenstates, one may then calculate the rate of 
stimulated transitions due to an RF magnetic field H x . Apply- 
ing Fermi’s golden rule, the probability of a transition between 
states / and j is [9] : 

w , 7 =±y 2 g(J)\(t i \g&S-H 1 \t / >\ 2 ( 5 ) 


The absolute maximum value of a 2 is obtained by using RF 
fields that are polarized such that H 1 is paralled to o. This 
value of a 2 is given by the trace of a a* [9] , which has the 
value: 


(o 2 )m BX = j(I<*I 2 + I0I 2 + IyI 2 ) (10) 


This expression will be used to compare transition strengths. 
This formulation does not consider whether or not the pre- 
scribed polarization of H x is achievable in a given microwave 
circuit. 


The gain of the maser is obviously of central importance 
and involves several important material parameters. The gain 
in dB of an unbroadened RWM or TWM [9] is 


G 


dB 



( 11 ) 


where s is the slowing factor, 2 is the maser length in free-space 
wavelengths, and Q m is the magnetic Q of the maser material. 
The magnetic Q is defined as 


where 7 = gfaj ft, and g(f) is the line shape (as a function of 
frequency) for the transition. The term sandwiched in the 
matrix element is the magnetic dipole interaction energy. 
Since non-spin operators may be removed from the matrix 
element in Eq. (5), it is convenient to define the vector 
quantity 

Ojj = IS I ( 6 ) 


_ energy stored in material 

^ m energy emitted per cycle by material 

( 12 ) 

Assuming « 1 (|f * 1/3 for /= 32 GHz, T= 4.2 K), the 
reciprocal of Q m may be expressed as [9] 

1 7 2 h hf Io 2 r) N 

Qm Wo kT no - of l eve l s ’ 
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where / is the inversion ratio, r? is the filling factor, and N is 
the spin density. The filling factor accounts for the fraction of 
RF field in the material and the degree to which the field is 
optimally polarized; its value varies between 0 and 1 . The spin 
density is determined by the concentration of paramagnetic 
ions; for 0.05 percent Cr concentration in ruby , jV= 2.35 X 1 0 19 
spins/cm 3 . From Eqs. (11) and (13), we may express G dB in 
terms of material parameters as: 


G 


dB 


Io 2 N 

A f L • no. of levels 


The inversion ratio is defined as 


(14) 


I = 


An 




AN.. 

) i 


05 ) 


where A Nj / = Nj - N i is the thermal-equilibrium population 
density-difference, and An (J is the population-density differ- 
ence under pumped conditions. To determine / for the signal 
transition of a multi-level spin system, one must solve a set of 
rate equations that accounts for both stimulated transitions 
and spin relaxation [9] . These rate equations may be simpli- 
fied by assuming steady-state conditions and saturated pump 
levels, and by neglecting the influence of the incoming signal. 
Since the relaxation rates are not known (this issue will be 
addressed in the aforementioned future report), the rate 
equations cannot be solved for the actual inversion ratio. 
Instead, assumptions are made about the relaxation rates and 
the corresponding / is determined. In one case, all relaxation 
rates are assumed to be equal and the inversion ratio is defined 
as l equaV An upper limit can be put on I by assuming an 
optimum set of relaxation rates exist [9] . This is defined as 
I opt . Note that the actual inversion ratio may be less than 
both I op t and I equai , but for pumping schemes employing two 
pumps, the actual inversion ratio often lies between I opt and 
I equaV Expressions for I opt and I equal are easily derived from 
the rate equations but will not be given here. 

A maser material “figure-of-merit,” indicating the gain- 
bandwidth potential of the material, was proposed [27] to be 
AfJQm • From Eq. (13), and considering only material param- 
eters, we find that 


A 4 a io 2 N 
O no. of levels 

m 


( 16 ) 


The computer analysis was performed on the JPL UNIVAC 
(F-system) with a modified version of an existing Fortran code 
[25] originally written for the analysis of ruby. The JPL 
Fortran subprogram HERMQR 1 was used to compute the 
eigenvalues and eigenvectors of H s . The existing code was 
modified to include the spin Hamiltonians of the other mate- 
rials. Other small modifications were also made. 

The inputs to the program are as follows: 

(1) material 

(2) range of 6 and 0-increment 

(3) a single value of 0 

(4) range of B and 5-increment 

(5) signal frequency window f L 0 , f HJ 

(6) minimum allowed value for max (a, 0, 7) for signal 
transition, a min 

The code is run for a given material at a given 0. Typical values 
are 6 = 0 to 90 deg, Ad - 10 deg, 0 deg < 0 < 90 deg, and 
B = 0 to 15 kG, AB = 0.5 kG. For a 32-GHz signal frequency, 
the window was usually f LO = 31 GHz and f HI = 33 GHz. For 
transitions falling within this range, o min = 1.0 was chosen. 
The signal frequency window and o min are used to pre-select 
operating points, thus decreasing computer output. 

The program outputs are the following: 

(1) energy levels and eigenstates 

(2) transition probabilities (both a, 0, 7 and a 2 ) 

(3) local values for— ~ and 

V ' AS A f s 

(4) 1 op V 1 equal* and figure-of-merit 

Because of the large number of possible pumping schemes 
(especially for the 6-level systems), the following guidelines 
were employed in choosing schemes: 

(1) signal transition is between adjacent levels 

(2) use two pumps, when possible 

(3) pumps can skip one level at most 

These guidelines limit the pumping schemes to the usual ones 
employed. Many other schemes are possible [9] . 


Hence, the figure-of-merit optimizes the product G dB • A f L . 
For the case of a linear stagger-tuned maser with bandwidth 
Af A f L , one can show that the quantity on the right-hand 

side of Eq. (16) optimizes the product G dB • Af. 


l JPL Fortran V Subprogram Directory , Fifth Edition, JPL Publication 
D-829 (internal document), Jet Propulsion Laboratory, Pasadena, 
California, July 1982. 
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IV. Results and Discussion 

Table 1 compares several of the more promising operating 
points of each of the materials analyzed, starting with ruby. 
The materials are arranged in order of increasing ZFS; for 
materials with S = 5/2, the ZFS of the lower degenerate states 
is used. Across the top of the table is the material name and 
operating point number. The first row of the table shows the 
paramagnetic ion used. The second row gives the ZFS. Note 
that all the materials have a ZFS larger than ruby. The third 
row indicates the number of magnetically non-equivalent ionic 
sites in the lattice; a “1” means that all sites are equivalent. 
The fourth row gives the orientation of B in terms of the polar 
and azimuthal angles, 8 and 0, measured with respect to the 
axes of the magnetic complex. If a value for 0 is not given, 
then the Hamiltonian is axially symmetric. For materials with 
non-equivalent sites, 8 and 0 are restricted to values for which 
the sites are equivalent. The fifth row of the table gives the 
magnitude of B. The fields do not exceed 14 kG for the oper- 
ating points shown. 

Rows 6,7, and 8 list the signal and pump frequencies with 
the corresponding transition levels shown in parentheses. The 
signal frequency is always 32.0 GHz. The pump frequencies 
vary roughly between 50 and 90 GHz. Larger pump frequencies 
will in general yield larger values of I opt and l equaV On the 
other hand, copper and dielectric losses increase at higher fre- 
quencies, with the result that heating of the maser structure 
may restrict the use of high pump power levels at high fre- 
quencies for some materials. Note also that pump frequencies 
within the same waveguide band simplify engineering issues. 

Row 9 gives the value of AfJAB evaluated near the operat- 
ing point. For a maser tunable over a wide range, AfJAB 
should be of the same sign and of similar magnitude for the 
maser material and the isolator material. Rows 10 and 11 give 
values of Af p jAf s (actually, Af p /AB • AB/Af s ) for both 
pumps. This parameter is indicative of the pump bandwidth 
required for a given signal bandwidth, so it is preferable for 
| Af p /Af s | to be as small as possible. For most of the operating 
points Af p /Af s » 2, but several have values <1. Note that 
Af/AB is evaluated as a simple two-point difference, with the 
second point arbitrarily located 200 G from the operating 
point. For strongly curved energy levels, these values may not 
be accurate across the desired band. 

Rows 12, 13, and 14 give a 2 for the signal and pump transi- 
tions for optimum elliptieally polarized fields according to 
Eq. (10). Recall from Eqs. (14) and (16) that G dB and the 
figure-of-merit are proportional to a 2 , so as large a value of a 2 
as possible is desired. In general, a 2 is a factor of 2- to 3-times 
larger for the 6-level spin systems. Similarly, a large value of 
a 2 is preferred, since the pump power required for saturation 


is inversely proportional to a 2 . According to [29] , the pump 
power required for saturation will satisfy 


P cc 

pump 


L • A/„ 


a 2 t 
P P 


(17) 


where r p is the effective pump relaxation time;r p is not iden- 
tical to the measured pump relaxation time. The values of a 2 
in Table 1 span nearly two orders of magnitude. 


Rows 15 and 16 show the inversion ratios for equal and 
optimum relaxation times. The values of I equai and I opt are 
similar for the various operating points, except when only one 
pump is employed. 

Finally, row 17 gives the material^ figure-of-merit, com- 
puted in units of MHz, as 


A^ = 5.6/ op ,-q, 2 

Q no. of levels 

m 


(18) 


This follows from Eq. (13) evaluated at/= 32 GHz, T= 4.2 K, 
7] = 0.5, and A" = 2.35 X 10 19 spins/cm 3 . Since the true inver- 
sion ratio for a given operating point may be as much as a 
factor of 3 or more smaller than I opv a detailed comparison 
of figure-of-merits could be misleading. 


Table 1 is by no means complete in the sense that one may 
confidently select the best maser material from it. Two very 
important parameters are missing: the actual inversion ratio 
and the pump power, P pump , required to maintain that inver- 
sion ratio. At present, both of these parameters must be 
measured. 


Since both / and P pump depend critically on relaxation 
times, a table of relaxation times, located in the Appendix and 
labeled Table A, was compiled from data found in the litera- 
ture. Ionic concentration, frequency, orientation, and transi- 
tion information is included. Because these parameters do not 
coincide with pump operating-points of interest to us, and 
because of the dependence of relaxation times on measure- 
ment technique and crystal growth procedures [30] , the data 
in Table A could easily be an order of magnitude or more 
different from what would be measured for the materials in 
Table 1 . Hence, the relaxation times in Table A are not used in 
any calculations in this work, even though they are the best 
values available to us at the present time. 

Before discussing the many materials in Table 1 , consider 
the operating point in which ruby is presently being used at 
32 GHz (first column of table, Ruby No. 1). The ruby is 
oriented at the double-pump angle (8 = 54.7 deg) and pumped 
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in the push-pull mode, so the pump frequencies are equal. 
Scanning down the column, two potential problems can be 
seen with this operating point. First, the pump bandwidths 
are nearly twice the signal bandwidths, so if 500 MHz of 
signal bandwidth is desired at 32 GHz, 1-GHz bandwidth must 
be pumped at 66 GHz. To pump such a large bandwidth, the 
pumps must be swept across the band, effectively reducing the 
pump power at a given frequency. How detrimental this is 
depends on the relaxation times of the pump transitions. 

The second problem with ruby at this operating point is 
the weak pump transitions: o 2 x - 0.05 and o 2 2 = 0.04 com- 
pared to o 2 = 1.92. For this reason, high levels of pump power 
are used in the 32-GHz RWM, although the pumped levels are 
still not saturated. Note that a small o 2 does not preclude 
good maser performance, as demonstrated by the 18- to 
26-GHz maser of Moore and Clauss [2], [31] for which 

°h = and ° 2 P 2 = 0-11- 

The most significant problem with ruby, the low inversion 
ratio, is not indicated by the table. Measured values of / for 
the case of saturated pump transitions have been approxi- 
mately 1.1 ([3], and J. Shell, private communication). (The 
similarity to I equaX = 1.1 does not necessarily mean that the 
relaxation times are equal.) In the 18- to 26-GHz range, mea- 
sured values of / have been in the range of 1.6 to 1.8 [2] , 

[3] > [31] * 

Finally, from Table A it can be seen that ruby has long 
relaxation times compared to the other materials. Hence, even 
though o 2 is small, the denominator of Eq. (17) remains large 
enough for ruby to require large but manageable pump power. 

Consider several other operating points in Table 1. Since 
ruby has worked so well in the past, ruby at another orienta- 
tion is an obvious candidate for a maser material. The second 
column of the table, Ruby No. 2, shows ruby at 9 = 90 deg 
and with a push-push pumping scheme. Even though a 2 and 
o p j are weaker and the values for I equal and I opt are less than 
for Ruby No. 1, if the actual inversion ratio is >1.5, Ruby 
No. 2 could yield a higher gain -bandwidth product. Some 
investigators [4] claim Ruby No. 2 to be supejior to Ruby 
No. 1 at millimeter wavelengths because of a higher inversion 
ratio and less critical orientation (less spreading of pump 
power due to c-axis wander). 

The sapphire host has many desirable properties, so Fe- 
doped sapphire is a logical choice. For Sapphire No. 1, o 2 is 

2- to 5-times stronger than that of Ruby No. 1 and o ] is 2- to 

3- times that of Ruby No. 1. However, according to Table A, 
the pump relaxation times may be an order of magnitude 
shorter, implying that Sapphire No. 1 could require several 
times the pump power of Ruby No. 1. Other investigators 


[28] suggest that the relaxation times of Fe-doped sapphire 
are similar to those of ruby. If this is true, then Sapphire No. 1 
could require several times less pump power than Ruby No. 1 . 
Measurements of the relaxation times and pump power re- 
quired must be made to determine which scenario is correct. 

Emerald has some similarity to ruby, having the same spin 
Hamiltonian and potentially long relaxation times. If the 
inversion ratio for Emerald No. 1 is >2, then this operating 
point would be very attractive. A problem with emerald is the 
difficulty of its growth, which may not allow the high degree 
of crystal perfection necessary. 

Zinc tungstate has a complicated H s , large ZFS values, and 
may have short relaxation times, making it quite different 
from ruby. Cr-doped zinc tungstate has several promising oper- 
ating points. In particular, ZnW0 4 No. 1 is attractive, assum- 
ing I ^ l opr Fe-doped zinc tungstate exhibits a large number 
of excellent operating points. For ZnW0 4 No. 3, a 2 is 2- to 
3-times greater than that of Ruby No. 1 and o 2 is 100 times 
that of Ruby No. 1. This large value of o 2 raises the question 
of whether it is preferable to have large a 2 and small r p 
(ZnW0 4 No. 3) or small o 2 and large r p (Ruby No. 1). Assum- 
ing the product o 2 * r p is constant, the pump power require- 
ments will be similar, but in the former case more energy 
would be transferred to the lattice. This could raise the tem- 
perature of the maser material, thereby decreasing the gain; 
however, this possibility has not been considered in detail. 
Harmonic cross relaxation may be a problem for several of the 
better operating points for ZnW0 4 :Fe. One similarity zinc 
tungstate has with ruby is that it can be grown by the Czoch- 
ralski method. 

The rutiles appear promising, but the large, anisotropic, 
temperature-dependent dielectric constant of rutile makes it 
unattractive from an engineering standpoint. 

One can easily see from Table 1 that many of the other 
materials analyzed may make excellent maser materials, but 
the lack of information on inversion ratios, pump power 
requirements, relaxation times, etc., makes them difficult to 
evaluate. 

Another possibility, not addressed in Table 1, is to use 
standard ruby doped with a fast-relaxing impurity. This addi- 
tional impurity may be added in the melt or created in the 
finished ruby by exposure to X-rays (so-called orange-ruby 
[9]). A properly chosen impurity can shorten certain relax- 
ation times, which, by making the times more optimal, can 
increase the inversion ratio. However, the impurity would not 
alter the ZFS, so the pump transitions would still be weak. 

We hope to eventually make measurements of inversion 
ratios, pump power requirements, and relaxation times at 
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32 GHz and around 60 GHz on several of the materials in 
Table 1. 

Other materials we would like to analyze but for which 
we do not have the spin Hamiltonians are spinekFe and 
chrysoberyhCr, Fe. 

V. Conclusions and Future Work 

Any of the materials analyzed in this work may yield better 
maser performance than does ruby at 32 GHz at the double- 
pump angle. However, several key parameters related to pump 
power requirements may eliminate some or all of these mate- 
rials. Based on results from the analysis of the spin Hamil- 
tonians and on scanty (and unreliable) relaxation time data, 
several materials show particular promise (e.g., Fe-doped zinc 
tungstate). 


To complete the materials evaluation, it will be necessary to 
measure the inversion ratio and pump power required for 
saturation for each operating-point of interest. Barring cross- 
relaxation and other concentration-dependent effects, know- 
ledge of the relaxation-times would be sufficient to calculate 
both / and P pump . However, the subtlety of measuring 
relaxation-times will most likely require that / and P pump be 
measured. 


Better understanding of the low inversion ratio of ruby is 
needed. By accounting for the spin-phonon interaction, one 
can calculate the relaxation rates of the transitions for low 
spin concentrations [29] . With these relaxation rates, the 
inversion ratios and pump power requirements can be calcu- 
lated for each operating-point of interest and for various 
physical temperatures. 
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Table 1. Promising 32-GHz operating points for ruby and other materials. The materials are 

arranged in order of increasing ZFS. 





Material 



Operating 













Point 

Ruby 

Ruby 

Ruby 

Sapphire 

Sapphire 

YAG 


No. 1 

No. 2 

No. 3 

No. 1 

No. 2 

No. 1 

Ion 

Cr 

Cr 

Cr 

Fe 

Fe 

Cr 

ZFS, GHz 

11.4 

11.4 

11.4 

12.1, 19,1 

12.1, 19.1 

15.7 

No. of ionic sites 

1 

1 

1 

2 

2 

1 

0,0, deg 

54.74 

90 

90 

90,45 

60,30 

54.74 

B, kG 

11.81 

13.50 

11.20 

9.50 

12.44 

13.27 

4.GHz 

32.0 (32) 

32.0(21) 

32.0 (32) 

32.0 (32) 

32.0 (54) 

32.0 (32) 

fp\ 

66.2 (13) 

70.3(13) 

57.6 (13) 

68.6 (13) 

77.3(13) 

76.2(13) 

fp2 

66.2 (24) 

43.3 (34) 

68.9 (24) 

59.9 (24) 

66.1 (35) 

76.2 (24) 

AfjAB, MHz/G 

2.9 

2.8 

2.7 

2.8 

3.0 

2.9 

Afpi/Af, 

1.9 

2.0 

2.0 

2.0 

1.8 

1.8 


1.9 

1.0 

2.0 

2.0 

2.0 

1.8 


1.92 

1.51 

1.97 

6.87 

5.64 

1.62 

°l l 

0.05 

0.02 

0.03 

0.08 

0.09 

0.13 

°P2 

0.04 

1.51 

0.02 

0.17 

0.28 

0.31 

I opt 

3.1 

2.2 

2.8 

3.0 

4.8 

3.7 

1 equal 

1.1 

0.7 

0.9 

1.0 

2.0 

1.4 

A4/fi m ,MHz 

8.3 

4.7 

7.7 

19.2 

25.3 

8.4 




Material 



Operating 













Point 

YAG 

YGG 

YGG 

Spinel 

Andalusite 

Andalusite 


No. 2 

No. 1 

No. 2 

No. 1 

No. 1 

No. 2 

Ion 

Cr 

Cr 

Cr 

Cr 

Cr 

Cr 

ZFS, GHz 

15.7 

20.9 

20.9 

29.7 

32.0 

32.0 

No. of ionic sites 

1 

1 

1 

4 

2 

2 

0,0, deg 

70 

54.74 

70 

54.74 

55,0 

70, 0 

B y kG 

13.40 

13.96 

13.35 

13.15 

12.94 

13.00 

fs' GHz 

32.0 (43) 

32.0 (32) 

32.0 (43) 

32.0 (32) 

32.0 (32) 

32.0 (43) 

fpl 

49.6 (12) 

81.8(13) 

53.3 (12) 

75.3 (13) 

74.2(13) 

47.0(12) 

fp 2 

69.7 (24) 

81.8 (24) 

70.7 (24) 

75.3 (24) 

75.8 (24) 

69.4 (24) 

Af s /AB, MHz/G 

2.4 

2.7 

2.2 

2.9 

2.8 

2.4 

A/p i /A/, 

1.2 

1.9 

1.3 

1.8 

1.9 

1.1 

Af p2 /Af s 

2.1 

1.9 

2.1 

1.8 

1.9 

2.1 

•? 

1.25 

1.50 

1.01 

1.66 

1.63 

1.27 

°|l 

1.52 

0.16 

1.54 

0.28 

0.15 

1.53 

°P2 

0.31 

0.47 

0.55 

0.13 

0.31 

0.31 

1 opt 

3.6 

4.0 

3.9 

3.6 

3.6 

3.5 

I equal 

1.5 

1.6 

1.6 

1.4 

1.4 

1.4 

Af L !Q m yUHz 

6.3 

8.4 

5.5 

8.4 

8.2 

6.2 
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Table 1 (contd) 


Operating 

Point 



Material 



Rutile 
No. 1 

Rutile 
No. 2 

Rutile 
No. 3 

Rutile 
No. 4 

ZnW0 4 
No. 1 

ZnW0 4 
No. 2 

Ion 

Cr 

Cr 

Fe 

Fe 

Cr 

Cr 

ZFS, GHz 

43.3 

43.3 

43.3,81.3 

43.3,81.3 

51.6 

51.6 

No. of ionic sites 

2 

2 

2 

2 

1 

1 

<9, 0 , deg 

45,0 

54.74,45 

52.55,40 

71.12,70 

40,90 

50,90 

B t kG 

12.78 

14.06 

9.35 

11.71 

9.78 

13.10 

GHz 

32.0 (43) 

32.0 (32) 

32.0 (32) 

32.0 (43) 

32.0(21) 

32.0 (32) 

fpl 

56.0(12) 

82.6 (13) 

78.9 (13) 

71.8 (12) 

54.1 (13) 

75.1 (13) 

fp2 

65.4 (24) 

82.6 (24) 

73.3 (24) 

81.4 (24) 

54.9 (34) 

88.4 (24) 

Af s /AB, MHz/G 

2.0 

2.6 

2.8 

3.7 

2.3 

1.9 

AApi/AT, 

1.5 

1.9 

2.1 

0.9 

1.1 

2.2 

&fp2^fs 

2.2 

1.9 

2.1 

2.1 

1.8 

2.6 

° 2 S 

1.57 

1.37 

3.80 

3.57 

1.68 

1.37 


1.18 

0.48 

0.67 

3.31 

0.53 

0.31 

°P2 

0.37 

0.30 

1.55 

1.01 

0.57 

0.61 

I opt 

3.7 

4.1 

3.7 

5.7 

2.1 

3.8 

I equal 

1.4 

1.6 

1.4 

2.6 

0.5 

1.5 

Af L,IQm’ MHz 

8.1 

7.9 

13.1 

19.0 

4.9 

7.3 




Material 



Operating 













Point 

Emerald 

Emerald 

ZnW0 4 

ZnW0 4 

Y 2°3 

Andalusite 


No. 1 

No. 2 

No. 3 

No. 4 

No. 1 

No. 3 

Ion 

Cr 

Cr 

Fe 

Fe 

Cr 

Fe 

ZFS, GHz 

53.5 

53.5 

61.0,76.9 

61.0,76.9 

72.7 

112.6,225.2 

No. of ionic sites 

1 

1 

1 

1 

4 

2 

6 , 0 , deg 

40,0 

54.74 

90,45 

90,45 

40 

30,0 

B, kG 

7.95 

14.04 

8.59 

5.28 

9.14 

6.63 

/..GHz 

32.0 (43) 

32.0 (32) 

32.0 (54) 

32.0 (54) 

32.0 (21) 

32.0 (21) 

fp 1 

48.4 (12) 

86.1 (13) 

66.0(13) 

67.9(13) 

69.1 (13) 

107.3 (13) 

fp2 

50.8 (24) 

86.1 (24) 

67.4 (35) 

57.6 (35) 

88.3 (24) 

- 

Af s /AB, MHz/G 

3.5 

2.2 

2.1 

-2.7 

-3.32 

4.78 


1.5 

2.2 

-0.1 

0.4 

-0.16 

-0.07 

Af p2 lAf s 

0.3 

2.2 

2.1 

-0.5 

-0.72 

- 

°S 

1.54 

1.38 

5.52 

3.24 

0.76 

3.55 

°P 1 

0.13 

0.66 

3.01 

3.17 

0.73 

2.32 

°h 

0.59 

0.19 

2.91 

3.51 

1.31 

- 

I opt 

2.6 

4.3 

4.3 

3.8 

3.6 

2.0 

I equal 

0.8 

1.7 

1.8 

1.5 

1.4 

0.5 

A4/e m ,MHz 

5.6 

8.3 

22.2 

11.5 

3.8 

6.6 
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Appendix 

Table of Relaxation Times 


This Appendix contains a table of relaxation times (7^) for 
various materials at 4.2 K. The paramagnetic ion concentra- 
tion, transition frequency, transition, and orientation are also 
shown. In addition, the literature references are included. Note 
that the table does not mention either the crystal growth 


process or the relaxation time measurement technique, both of 
which may significantly alter the reported relaxations times. 
Hence, these relaxation times could easily be an order of mag- 
nitude or more different from what one might measure, and 
should not be used in calculations unless verified. 


Table A-1 . Measured relaxation times, T r for various materials at physical temperature T = 4.2° K 


Material 

Ionic 

concentration, 
atomic percent 

Frequency, GHz 

Orientation, deg 
(9,0) 

Transition 

Tj, msec 

Reference 

Ruby 

0.03 

34.6 

90 

3-4 

21 

[32] 





2-3 

16 






1-2 

22 






2-4 

54 






1-3 

56 



0.013 

35 

90 

2-3 

15.5 

[33] 


0.052 



2-3 

17.5 


Sapphire: Fe 

0.03 

34.6 

90,0 

4-5 

1.8 

[34] 





3-4 

1.6 






2-3 

1.5 






1-2 

2.0 


Emerald 

4.9 X 10 19 ions/cm 3 

9.3 

0 

3-4 

9 

[35] 




90 

1-2 

8 






3-4 

11 


ZnW0 4 : Cr 

0.005-0.3 

9.2 

90, 90 

1-2 

*1.5 

[36] 

0.018-0.72 

33 

90,90 

1-2 

*0.5 



0.08 

X-band 

? 

5-6 

~0.3 

a 





1-2 

-1 


Rutile: Cr 

0.07 

34.6 

90, 0 b 

1-4 } SitCA 

4.5 

2.5 

[34] 





il } siteB 

2.5 

2.1 


Rutile: Fe 

0.01-0.02 

9.4 

0,0 

1-2 

*2 

[37] 





3-4 

*2 


a J. Orton, private communication with K. Standley and R. Vaughan. 
b For this orientation the ionic sites are inequivalent. 
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The cryogenic noise temperature performance of a two-stage and a three-stage 32-GHz 
HEMT amplifier has been evaluated . The amplifiers employ 0.25-pm conventional AlGaAsj 
GaAs HEMT devices , hybrid matching input and output microstrip circuits , and a cryo- 
genically stable dc biasing network. The noise temperature measurements were performed 
in the frequency range of 31 to 33 GHz over a physical temperature range of 300 K down 
to 12 K. Across the measurement band , the amplifiers displayed a broadband response , 
and the noise temperature was observed to decrease by a factor of 10 in cooling from 
300 K to 15 K. The lowest noise temperature measured for the two-stage amplifier at 
32 GHz was 35 K with an associated gain of 16.5 dB, while the three-stage amplifier 
measured 39 K with an associated gain of 26 dB. It was further observed that both ampli- 
fiers were insensitive to light. 


I. Introduction 

Traditionally, the extraordinarily sensitive receiver systems 
operated by the Jet Propulsion Laboratory’s Deep Space Net- 
work (DSN) have employed ruby masers as the low-noise 
front-end amplifiers. The rapid advances recently achieved by 
cryogenically cooled high-electron-mobility transistor (HEMT) 
low-noise amplifiers (LNAs) in the 1- to 10-GHz range are 
approaching maser amplifier performance [1] , [2] . In order to 
address its future spacecraft navigation, telemetry, radar, and 
radio science needs, the DSN is investigating both maser [3] 
and HEMT amplifiers for its 32-GHz downlink capability. 
This report describes the noise temperature performance of 
the 32-GHz HEMT LNAs. 


II. HEMT Device 

Since one of the primary functions of the LNA is to mini- 
mize the receiver system noise temperature, the characteriza- 
tion and selection of HEMT devices is critical to the LNA’s 
performance. The selection of the 0.25-/um gate length con- 
ventional AlGaAs/GaAs HEMTs was based on their previously 
demonstrated reliability and exceptionally high gain and low 
noise characteristics [4] , [5] . 

The devices were fabricated on selectively doped AlGaAs/ 
GaAs heterostructures grown by molecular beam epitaxy 
(MBE) with a Varian GEN II system on a 3-inch-diameter 
GaAs substrate. The details of the material growth conditions 
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are discussed elsewhere [6] . Figure 1 schematically illustrates 
the cross section of the HEMT device. The HEMT wafer 
exhibited a sheet carrier density of 8.1 X 10 12 /cm 2 with a 
mobility of more than 75,000 cm 2 /V*sec at 77 K. All levels 
were defined by electron beam lithography, and the T-shape 
gates were fabricated using the PMMA/P(MMA-MMA)/PMMA 
tri-layer resist technique [7] to achieve a low series gate 
resistance. 

For low-noise performance at cryogenic temperatures, the 
HEMT device must exhibit good pinch-off characteristics and 
high transconductance, g m . Good pinch-off characteristics are 
achieved by strong confinement of the charge carriers to the 
channel region with a sharp interface of high quality and a 
large conduction band discontinuity. An enhanced g m at the 
operating bias is obtained by a judicious choice of doping 
concentration and space layer thickness [8] . An A1 mole 
fraction of approximately 30% is required for a large conduc- 
tion band discontinuity, while the high g m is achieved with a 
4-nm spacer layer and a doping concentration of approxi- 
mately 2 X 10 18 dopant atoms/ cm 3 . Although these values 
will result in a high-performance room-temperature device, 
at physical temperatures below 150 K the device will suffer 
from I-V collapse [9] and exhibit the persistent photocon- 
ductivity effect associated with the presence of deep donor 
traps (called DX centers). In order to obtain excellent device 
performance at cryogenic temperatures and to eliminate light 
sensitivity, previous work [1] , [8] has demonstrated that the 
A1 composition must not exceed 23% and the doping concen- 
tration must be approximately 10 18 /cm 3 . 

The data shown in Table 1 , comparing two HEMTs with the 
same A1 mole fraction (23%) but different doping concentra- 
tions in the n-AlGaAs layer, serves to illustrate the difference 
between low-temperature and room-temperature device opti- 
mization. Device A has an n-AlGaAs doping concentration of 
10 18 /cm 3 , while that of B is twice as high. As expected, 
device B exhibited a higher g m and associated gain than device 
A, with approximately the same noise figure for both devices 
at 300 K. However, at 13 K and 8.5 GHz, device B exhibited 
a minimum noise temperature of 13.1 K, while device A 
yielded a value of 5.3 K. 

III. Amplifier Design and Circuit 

Both LNAs were designed to achieve the best room-temper- 
ature low-noise performance based on the measured room- 
temperature device parameters. Following construction and 
room-temperature optimization, the LNAs are then biased for 
lowest noise performance at cryogenic temperatures. 

The device gate width of 75 pm selected for this work was 
determined by the tradeoffs associated with optimum impe- 


dance matching, circuit bandwidth, intermodulation distor- 
tion, power handling capability, and power dissipation. Figure 2 
shows a photograph of the 32-GHz hybrid two-stage HEMT 
LNA package. The input and output ports utilize a broadband 
WR28-to-stepped ridge waveguide-to-microstrip transition. 
Figure 3 shows the insertion loss and return loss of a stepped 
ridge fixture that consists of two stepped-ridge transitions 
connected back-to-back with a microstrip 50-ohm line 0.5 in. 
long. The input and output matching networks were designed 
based on the device equivalent circuit values obtained from 
fitting measured S-parameters at the low-noise bias condition 
to the model from 2 to 20 GHz. Figure 4 shows the topology 
used for the 0.25-jum HEMT equivalent circuit model. Input, 
output, and interstage matching circuits were designed on 
10-mil quartz substrate with TaN thin-film resistors and 
TiWAu metallization. A schematic diagram of the two-stage 
hybrid HEMT LNA is shown in Fig. 5. The edge-coupled 
symmetric microstrip dc blocking transmission line also served 
as a bandpass filter, improving the out-of-band stability. As 
shown in Fig. 6, the three-stage LNA is constructed from the 
two-stage LNA by the insertion of another interstage match- 
ing circuit. 

The LNA fixture (OFHC copper) and dc bias circuits [10] 
are designed for operation at cryogenic temperatures. Diode 
protection was included in both the gate and drain bias cir- 
cuits. LEDs were mounted on the cover of the fixture above 
each of the HEMTs for the purpose of examining their light 
sensitivity at cryogenic temperatures. All of the stages use 
devices from the same wafer. 

IV. Measurement Results 

The LNAs were first measured at room temperature with 
the devices biased for lowest noise at room temperature and 
then biased for lowest noise performance at cryogenic tem- 
peratures. The LNA room-temperature broadband noise fig- 
ure and gain for the two- and three-stage LNAs are shown in 
Figs. 7 and 8, respectively. Both LNAs exhibited an average 
noise figure of approximately 2 dB from 28 to 36 GHz. From 
29 to 34 GHz, the gain measured approximately 17 dB and 
24 dB for the two-stage and three-stage LNA, respectively. 
The addition of an external isolator only slightly degraded the 
gain and noise figure by 0.3 dB. 

With the devices biased for lowest noise at cryogenic tem- 
perature (12 K), the noise temperature (referenced at the 
room-temperature input waveguide flange) of both LNAs was 
observed to decrease nearly quadratically as a function of 
physical temperature as they cooled from 300 K to 12 K. 
(See Fig. 9 for a diagram of the closed-cycle refrigerator and 
measurement system.) The noise temperature of the two- 
stage LNA decreased from 350 K at ambient to 35 K at 
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14.5 K, while the three-stage LNA decreased from 400 K to 
41 K at 12.5 K (Figs. 10 and 11). Figures 12 and 13 show the 
cryogenic noise temperature and gain response from 31 to 
33 GHz, along with bias settings for the two-stage and three- 
stage LNA, respectively. At 32 GHz, the two-stage LNA noise 
temperature measured 35 K, with an associated gain of 16.5 dB, 
at a physical temperature of 14 K, while the three-stage LNA 
yielded a value of 41 K with a 26.0-dB associated gain. It is 
also noted that the three-stage LNA displayed an almost 
flat noise temperature response across the measurement band, 
with a minimum noise temperature of 39 K at 32 GHz, while 
the two-stage LNA displayed a noise temperature response 
decreasing monotonically from 31 to 33 GHz, with a mini- 
mum noise temperature of 31 K at 33 GHz. 

It was further observed that both amplifiers did not show a 
persistent photoconductivity effect. That is, it was found that 
these devices can be cooled with or without illumination 
and/or dc bias, without any observable effect on the cryogenic 
low-noise performance. 


V. Conclusion 

Cryogenic coolable state-of-the-art 32-GHz HEMT LNAs 
have been demonstrated using 0.25-jum AlGaAs/GaAs HEMTs. 
The results clearly demonstrate their potential to meet the 
future need for extremely low-noise receivers for applications 
such as the DSN. Further advances in HEMT technology 
[12] promise to lead to improved performance at all frequen- 
cies and make possible the development of amplifiers operat- 
ing at frequencies up to 94 GHz. 

Currently, the DSN relies on maser amplifiers in order to 
provide the best possible telemetry support for deep space 
missions. These systems require a complex and expensive 
cryogenic system operating at 4.5 K. Since HEMT LNAs 
require less cooling power and operate at a higher physical 
temperature (12 K), they can be operated at less cost with a 
more reliable refrigeration system. The lower cost of HEMT 
LNAs will lead to greater frequency coverage and the eco- 
nomic realization of multiple-element cryogenic array feed 
systems. 
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Table 1. Performance comparison of two types of conventional AIGaAs/GaAs HEMTs 


Ambient 
Temp., K 

Freq., 

GHz 

Performance 

parameter 

Type A 
(10 18 /cm 3 ) 

Type B 

(2 X 10 18 /cm 3 ) 

300 


g m (mS/mm) 

380 

450 

300 

8 

Noise figure (dB) 

0.4 

- 

300 

8 

Associated gain (dB) 

15.2 

- 

300 

18 

Noise figure (dB) 

0.7 

0.7 

300 

18 

Assoc, gain (dB) 

11.5 

15.0 

300 

32 

Noise figure (dB) 

1.3 

1.2 

300 

32 

Assoc, gain (dB) 

7.5 

10.0 

13 

8.5 

Noise temp. (K) 

5.3 

13.1 

13 

8.5 

Assoc, gain (dB) 

13.9 

14.5 


75 
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Fig. 1. Cross section of the 0.25-/xm T-gate HEMT. 


Fig. 2. Ka-band two-stage HEMT LNA. 
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Fig. 3. Measured performance of Ka-band stepped ridge fixture. 
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Fig. 4. Equivalent circuit model of 0.25-/xm AIGaAs/GaAs HEMT. 
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Fig. 5. Schematic diagram of two-stage hybrid LNA using 0.25x75-/i.m HEMTs. 
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Fig. 6. Two-stage (a) and three-stage (b) HEMT LNA microstrip 
matching circuits. 


Fig. 8. Three-stage HEMT LNA room-temperature broadband 
response. 
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Fig. 9. 32-GHz cryogenic noise temperature measurement system. 
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Fig. 10. Two-stage HEMT LNA noise temperature and gain cooling curve. 
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Fig. 11. Three-stage HEMT LNA noise temperature and gain cooling curve. 
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Fig. 13. Three-stage HEMT LNA at 12.5-K noise temperature and gain. 
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This report describes modeling of cross-guide couplers based on the theory of equiva- 
lent electric and magnetic dipoles of an aperture. Additional correction factors due to 
nonzero wall thickness and large aperture are also included in this analysis. Comparisons 
of the measured and calculated results are presented for cross-guide couplers with circular 
or cross-shaped coupling apertures. A cross-guide coupler was designed as a component of 
the C-band feed to support the Phobos mission. 


I. Introduction 

In this article, the performance of a directional coupler is 
specified by two parameters: coupling and directivity [1] . The 
circular aperture and cross-shaped aperture are chosen because 
the circular aperture is the most well-studied aperture shape 
and the cross-shaped aperture is frequently used for cross-guide 
couplers. 

A program was developed to calculate the coupling and the 
directivity of circular or cross-shaped aperture couplers. Either 
one aperture (Fig. 1) or two symmetrically spaced apertures 
(Fig. 2) may be used for directional coupling. The angle between 
two waveguides is arbitrary for one-aperture couplers, but it is 
90 degrees for two-aperture couplers. The apertures may be 
moved along the diagonal line in the common broad wall of 
the waveguides. Also, the distance between two apertures 
located on the diagonal line varies. The coupling and the direc- 
tivity for directional couplers may be calculated over a band of 
frequencies. 

II. Theory 

A. Basic Formulas 

The formulas for cross-guide couplers with off-centered 
apertures are derived in the Appendix. The radiation ampli- 


tude in the secondary waveguide for the coupled port (B + ) and 
the isolated port (B~) for the coupler of Fig. 1 is given by 

B+(p,m,d,9) = B t (/?,</) + B 3 (m,cOcQs0 + G(m f d)sin9 

B~(j o,m,d,6) = B y (p,d) + B 4 (w,<f)cos0 

where 9 is the angle between two waveguides, d is the distance 
from the center of the aperture to the waveguide wall, and p 
and m are the electric polarizability and magnetic polariza- 
bility of the aperture. 

p = p 0 * FE • TANE 

m - m Q • FM • TANM 

Here, p 0 and m 0 are the electric polarizability and magnetic 
polarizability of a small aperture with zero wall thickness. 
They are constants that depend on the shape and the size of 
the aperture. For a circular aperture, p 0 r 3 , m Q = -|r 3 , 
where r is the radius of the aperture. Values for p 0 and m Q for 
a cross-shaped aperture are given in [2] . FE , FM , TANE , and 
TANM are defined below. 
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B. Wall Thickness Factor 

The finite wall thickness of an aperture has the effect of 
reducing the coupling. The effect of finite thickness is taken 
into account by treating the aperture as a finite length of wave- 
guide beyond cutoff. FE(t ) and FM(t) represent the electric 
attenuation and magnetic attenuation in the aperture of thick- 
ness t and are given by [3] 


The resonant frequency is approximately equal to the cutoff 
frequency of a waveguide having the same cross-sectional 
shape and size as the aperture. Therefore, / 01 may be replaced 
by/ cl and/ 02 may be replaced by/ c2 . 


c 



FE(t) = e 2n 


FM(t) = e 



1/2 

f ' AE 


1/2 

t • AM 


where A cl = the cutoff wavelength of TM 0l mode of aperture 
waveguide, X c2 = the cutoff wavelength of TE XI mode of aper- 
ture waveguide, \ cl = 2.6127/*, \ c2 = 3.4126/* for a circular 
waveguide, while \ cl and \ c2 of a cross-shaped waveguide are 
given in [4] and [5] . 

The additional factors AE and AM are effective wall thick- 
ness coefficients. For a circular aperture with radius r and 
thickness t [6] , [7] 


AE = 1.0103 + 0.0579 y 
AM = 1.0064 + 0.0819y 


t/r > 0.2 


AE = 1.1091 - 0.0082268 y 

f/r< 0.2 

AM = 1.4273 -0.0023284 j 

The AE and AM of a cross-shaped aperture are determined 
experimentally. 


f. 


c 2 


C 



III. Results 

A. One-Circular-Aperture Coupler 
with 9 = 45 Degrees 

An off-centered circular-aperture coupler with adjustable 6 
has been fabricated. The dimensions of the WR1 12 coupler are 
0.17-inch aperture radius, 0.128-inch thickness, and 0.283-inch 
distance from center of aperture to waveguide wall. For 6 - 45 
degrees, the calculated and the measured coupling and direc- 
tivity at frequencies from 7 to 9 GHz are shown in Figs. 3 and 
4, respectively. The calculated coupling is 0.5 to 0.7 dB higher 
and the directivity is 0.9 to 1.1 dB lower than measured. The 
results show good agreement between the coupler model and 
the experiment. 

B. Two-Circular-Aperture Cross-Guide Coupler 

A two-aperture cross-guide coupler using circular aper- 
tures was designed based on the preceding theory. The final 
coupler design had the following dimensions: WR125, 0.1 3-inch 
aperture radius, 0.05-inch thickness, and 0.3125-inch distance 
from center of aperture to waveguide wall. The measured 
coupling is 0.2 to 0.6 dB lower than calculated, while the 
directivity is 1 .0 to 1 .4 dB higher than expected at frequen- 
cies from 7 to 9 GHz (Figs. 5 and 6). In this case, and in 
general, the coupling is predicted more accurately than the 
directivity. The coupler computer program provides a pessi- 
mistic value of directivity. 


C. Large Aperture Factor 

An infinitely thin aperture actually has an unlimited num- 
ber of resonances. For a large aperture, the following frequency 
correction factors are needed [8] : 

TANE = -^ tan Jj- 

n 2 /.. 


TANM = 


V, 


02 
7 rf 


tan 


j£_ 


02 


C. Two-Cross-Shaped-Aperture Cross-Guide 
Coupler 

As an example of a design using cross-shaped apertures, a 
C-band coupler meeting the following requirements was 
designed. 

Coupling: -30 ± 1 dB 

Directivity: 20 dB minimum 
Waveguide: WR187 
Frequency: 4.96-5.0 6 GHz 
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The final design uses two symmetric cross-shaped apertures 
of 0.662-inch length, 0.115-inch width, 0.05-inch thickness, 
and located at a 0.468-inch distance from center of aperture to 
waveguide wall (Fig. 7). AE - 1 .36 and AM = 1 .53 were deter- 
mined from a previous WR125 30-dB coupler experiment. 
The coupler is expected to have approximately -30-dB cou- 
pling and 25.6-dB directivity according to the coupler com- 
puter program. 

The measured and computed results are shown in Figs. 8 
and 9. In this case, the experimental coupling is approximately 
0.4 dB lower and the directivity is 0.8 to 1.5 dB higher than 


calculated. Figures 8 and 9 show that the directional coupler 
meets the design requirements. 

IV. Conclusion 

A brief description of a cross-guide coupler model was 
presented. Three specific examples have demonstrated good 
agreement between experiment and theory. In most cases, a 
suitable coupler can be designed using the simple theory pre- 
sented in this report. Further directional coupler study is 
required to improve the prediction of directivity. Coupling 
between apertures and the nonuniform field over the aper- 
ture could be included in the model to obtain higher accuracy. 
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Fig. 5. Coupling of two-circular-aperture cross-guide coupler. 
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Fig. 8. Coupling of two-cross-shaped-aperture 
cross-guide coupler. 



Fig. 6. Directivity of two-circular-aperture cross-guide coupler. 
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Fig. 9. Directivity of two-cross-shaped-aperture 
cross-guide coupler. 


Fig. 7. A C-band two-cross-shaped-aperture cross-guide coupler. 







Appendix 

Equations for Offset Aperture with an Arbitrary Angle 9 


In this appendix, the results in [1] are extended to include 
an offset aperture with an arbitrary angle 9 between two 
waveguides. 

Referring to Figs. 1 and 2, assume the dimensions of the 
rectangular waveguides are a and b. The incident field in the 
primary waveguide is TE l0 with amplitude^. 


The field radiated by the electric dipole has an amplitude 


B x (pA6) 


^ g~ 
abY io 

cj 


/“ e 0 

abY 

CJ 


p A sin 2 


ud 

a 


I. One-Offset Aperture with Arbitrary 9 

The equivalent electric and magnetic dipoles (P,M) located 
at the center of the aperture (z = 0, x = d) for radiation into 
the secondary waveguide are 

P = e 0 pA siny a y (A-l) 

M = -mAY (-sin — a +/^-cos — a) (A-2) 
w\ a x 1 fi a a 2 1 v J 

where p and m are the electric and magnetic polarizability, 
is the wave admittance for TE 10 mode, and j3 is the propaga- 
tion constant. 

If the secondary waveguide with a coordinate system de- 
fined by unit vectors 2^, d' y , a z is rotated by an angle 9 with 
respect to the primary waveguide with a coordinate system de- 
fined by unit vectors $ x ,a y , a z (see Fig. 1) 


= B x (p,d) 


in the coupled port and 


w 


= B x (p,d) 


in the isolated port. 

The field radiated by the magnetic dipole has amplitude 
B 3 M8) =-£§-B~ io . M' 

CJ 

sin 2 — +— - — cos 2 — )cos© 

V a 0 V a ) 


ab 


mA 


a x = cos© a' x + sin© d’ z (A-3) 

? = 3 ; (A4) 

tt z = -sin© a' x + cos© a' z (A-5) 

Substituting Eqs. (A-3), (A4), and (A-5) into Eq. (A-l) and 
(A-2) 

P t 1 . TTCf /\/ 

= e 0 pA sin— 

M' = -mAY [-(sin — cos© +/-£- cos — sin©)^' 
w l_ V a ' 0 a a I * 

+ (-sin — sin© cos— cos©) 3(1 
\ a (3a a I Z J 


, _ . u . nd nd . . 

+ 2 1 rr- sin — cos — sin© 
(3a a a 

= B 3 (m,d ) cos© + G(m,d) sin© 


in the coupled port and 


ab 


mA 



7 id 7T 2 
* 0 2 * 2 


COS' 


^jcos0 


= B 4 (m,d) cos 9 
in the isolated port. 
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Therefore, the field in the secondary waveguide has an 
amplitude 

B + (p y m,d,Q) = B x (p,d) + B 3 (m,d) cos0 + G(m,d) sin0 
in the coupled port and 

B~(p,m,d,d) = B x (p,d) + B 4 (m,d) cosO 
in the isolated port. 

The coupling (C) and directivity (D) are given 



Some special cases are 

(A) d = -|, 6 * 90° 

C K)- 0 

-« 4 (™. y ) 

= fi i(P’f) + fi 3( m 'y)cos 0 
cosd 


(B) d=^,6 = 90° 

i? + (p,//iy,90j = B _ (p,m,y90°) = £,(/?,§) 

D = OdB 

(C) d*£,0 = o° 

B + (p,m,d,Q°) = B x (p,d) + B 3 (m,d) 

B~(p,m,d, 0°) = 

(D) </*y,0 = 9O° 

5 + (p,/M,90°) = 5 1 (^,cO + G(ot, 0 
£~0?,m,</,90°) = (/>,</) 

II. Two Symmetrically Spaced Apertures 
with 9 = 90 Degrees 

For two symmetrically spaced apertures which are located 
at d 1 = d and d 2 = a - d, let A = a - 2d, 0 = 90° (Fig. 2). The 
field radiated in the secondary waveguide is 

BB + (p,m,d, 90°) = B + (p,m,d, 90°) + B + (p,m,a - d,9O°)e- 2J0A 
in the coupled port and 

BB~(p,m,d, 90°) = [B~(p,m,d, 90°) + B~{p,m,a - d,90°)] e~ } & A 
in the isolated port. 
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This report summarizes recent developments in the analysis of various waveguide com- 
ponents and feedhorns using Modal Analysis (Mode Matching Method). A brief descrip- 
tion of the theory is presented , and the important features of the method are pointed out. 
Specific examples in circular , rectangular, and coaxial waveguides are included, with com- 
parisons between the theory and experimental measurements. Extensions to the methods 
are described. 


I. Introduction 

Modal Analysis has been shown to be a highly accurate and 
versatile method for analyzing a wide variety of waveguide 
devices [l]-[4]. The method is capable of accounting for 
multiple reflections within the device, stored energy at each 
discontinuity, and higher-order mode propagation if it occurs. 
Its high accuracy makes it useful for tolerance studies after a 
final design has been determined. This report compares com- 
puted and experimental results for the scattering parameters 
of three examples. One example is taken from each of the 
following waveguide types: rectangular waveguide, circular 
waveguide, and coaxial waveguide propagating the TE n mode. 

II. Theory 

The theory described below is well known and is summa- 
rized in [1] . In applying the modal analysis method, the wave- 
guide device is broken up into a series of sections that are 
joined by a step discontinuity as is shown in Fig. 1 . For smooth 
changes in waveguide dimensions, the change is approximated 


by a large number of steps. At this point, the type of wave- 
guide is arbitrary but the common area between the two 
guides must be identical to the cross-section of the smaller 
waveguide. This eliminates a class of offset connections but 
is usually not important for analyzing a practical device. In 
addition, for the circular waveguide and the coax, all guides 
are required to possess the same center line. This simplifies 
the analysis since only modes with one azimuthal variation 
need to be considered. Again, this is not restrictive for most 
practical applications. Next, the development of some of the 
important equations is presented. In Fig. 1, the fields to the 
left of the junction (z < 0) are represented as a sum of the 
normal modes of waveguide I. 

M 

B, - £ 0) 

m = l 
M 

a, - £ <* tm ( 2 ) 

m = 1 
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Here M is chosen large enough for convergence, and e Im 
and h Im are the normalized vector functions for the mth mode. 
For example, in a circular waveguide, m = 1 = TE n , m = 2 
= TM n , m = 3 = TE 12 , etc. A Im represents the magnitude of 
the forward traveling mth mode, and B Im the magnitude of 
the reverse traveling mth mode. 


i 


The normalization of e Im and h Im is such that 

(?/m X — /m^ * ds = R mm 
and from the orthogonality of the waveguide mod 
(i lm X h In ) ‘ds =0 m*n 


(3) 
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(4) 


Similarly, in region II 
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-II * ^Iln e 
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( 5 ) 


( 6 ) 


where N is the number of modes chosen in region II. Rela- 
tions analogous to Eqs. (3) and (4) hold for this region also. 

Matching the electric and magnetic fields over the common 
aperture results in the following scattering matrix equation. 


B = [S] A 


B = 


B x 
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ii 


( 7 ) 


( 8 ) 


A = 


l ii 


and 


[S] = 


Pni W 
1^2 J ^22^ 


( 10 ) 


In Eqs. (8) and (9), B Y and B n are vectors containing the 
reflected-mode amplitudes, while ^4j and A u contain the 
incident -mode amplitudes. The derivation of these equations 
is given in the Appendix. 

For the normalized vectors, the power carried by the mth 
forward traveling mode is given by \A Im | 2 , and for the mth 
reverse traveling mode is I B lm | 2 . 

Next, the matrices for a straight section of length L are 
needed. The solution is trivial, giving 


[S n ] - [0] 

(ID 

[S 12 ] - 

(12) 

P ai l = M 

(13) 

[S 22 ] = [0] 

(14) 

where [0] is the zero matrix and [y l2 ] and [ 7 2] ] 
matrices with elements 

are diagonal 

y 2l (n,n) = e y " L = y 12 (n,n) 

(15) 


y n being the propagation constant for the nth mode and L 
being the section length. 

This completes the summary of the required equations for 
each step in the analysis. Using these results, matrices for each 
step and each straight section in the device are determined and 
then combined using equations in [1] . At the completion of 
the analysis, the overall matrix is obtained, relating the nor- 
malized output vectors B l and B n at the ends of the device to 
the normalized input vectors A x and^4 n . 


fi, ■ 


(16) 

( 17 ) 


In many situations, a set of modes is incident only on the 
(9) left end of a device and one wants to determine the reflected 
and transmitted modes. The user specifies the input mode 
vector ,4j, A u = 0, and Eqs. (16) and (17) become 


B , = 

*n= 


(18) 

(19) 
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III. Results 

Computer programs [5] have been written to carry out the 
above calculations for rectangular, circular, and coaxial wave- 
guides. For the junctions involved, the integrals of Eq. (A-6) 
can be carried out in closed form, which greatly simplifies the 
programming. Three examples, one in each waveguide type, 
are used to demonstrate the excellent agreement between theory 
and experiment. All of the measurements were made using an 
HP 8510 network analyzer and a full two-port calibration. 

The first example, shown in Fig. 2, is a circular waveguide 
transition. For the theoretical results, 25 modes were used in 
the input section. The number of modes used in subsequent 
sections is chosen by the program for optimum convergence 
[1 ] . The return loss measurements (Fig. 3a) are typically within 
0.2 dB, except near the minimum reflection point at 8.25 GHz. 
Phase results (Fig. 3b) are also in close agreement, typically 
within a few degrees across the band. For nearly every obser- 
vation point, the difference between theory and measurement 
is within the accuracy specification of the network analyzer. 
Slight inaccuracies in the waveguide dimensions and rounding 
of some of the corners can also account for the small disagree- 
ment that remains. Figure 4 illustrates the convergence of the 
solution at 8 GHz as the number of modes used in the input 
section is increased. From these plots, we see that the solution 
has stabilized once 20 or more modes are used in the input 
waveguide. The number of modes required for convergence 
depends on the particular device, but in general larger wave- 
guides with respect to a wavelength and thin irises require that 
more modes be used in order to get the same accuracy. 

A rectangular waveguide example is shown in Fig. 5 and 
both theoretical and experimental results are given in Fig. 6. 
The device is a WR125 to 0.8-inch square-to-WR125 transition 
that was fabricated for use in the ring resonator at 8.51 GHz. 
The device consists of nine waveguide sections. The figure 
shows that, as in the previous example, the theoretical and 
experimental results are in excellent agreement. Only slight 
discrepancies appear near the minimum reflection point. More 
modes may be needed to represent the field in this region, or 
else the 0.030-inch radius on all corners, which was not 


accounted for in the calculation, may have a stronger effect 
in this frequency band. For this example, modes with first 
index m less than or equal to 7 and second index n less than 
or equal to 6 were used in the input guide. As with the circular 
waveguide program, maximum mode indices in the following 
sections are chosen according to waveguide size and symmetry 
considerations. 

The final example is the coaxial iris shown in Fig. 7, with 
theoretical and experimental return loss results shown in 
Fig. 8. The coaxial region is excited by a TE n circular wave- 
guide mode that excites only the higher-order coax modes 
with first index equal to 1 ; the normal TEM coax mode is not 
excited in this case. Measurements of the iris were made by 
calibrating in the circular waveguide and using the time domain 
gating features of the HP 8510 network analyzer to isolate 
the reflections from only the iris. The only other complica- 
tion associated with the coax is that a transcendental equa- 
tion must be solved for each mode in each section in order 
to determine a cutoff wavelength. This increases the computa- 
tion time required to solve a coax problem compared to a 
similar circular or rectangular waveguide problem. As with 
the previous examples, the agreement between theory and 
experiment is good, particularly considering the errors intro- 
duced by using the time domain features of the HP 8510. 

IV. Conclusion 

Three representative examples have been given to demon- 
strate the accuracy of the modal analysis method. A large 
number of waveguide devices such as horns, corrugated wave- 
guides, transitions, filters, and smooth tapers can be analyzed 
using these programs. In addition, several extensions have 
been made to the codes in order to allow for differing dielec- 
tric constants in the sections, making them useful for window 
design. For large smooth-wall or corrugated horns, the reflec- 
tion at the aperture may be neglected, and the far-field pattern 
can be found from the propagating modes in the aperture. In 
addition, the important case of ring-loaded slots [6] , which is 
a combination of the coaxial and circular program, has also 
been programmed, but no experimental results are presently 
available. 
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Fig. 4. Convergence for circular waveguide example: (a) return 
loss and (b) phase. 



Fig. 5. Rectangular waveguide example. 



Fig. 6. Rectangular waveguide return loss results. 
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Appendix 

Derivation of the Waveguide Scattering Matrix Equation 


To derive Eq. (10), the electric fields inside the common 
aperture between the two regions are matched. 

Ej = E u inside Sj (A-l) 

Ej X h Un = E u X h Un inside S 7 (A-2) 



ffi.x 



®„X *„.)•* 


Un* 


(A-3) 


Since E = 0 on the conductor making up the surface S u - S v 
the integral on the right-hand side of Eq. (A-3) may be extended 
over S jj. 



(Mi x h Un ) • ds 





(A-4) 


giving 
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^ Im ^ ^ ^mn^Un 
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(A- 10) 
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Equations (A-5) and (A-10) may be recast into a more com- 
pact matrix form, giving 


[P] = [Q] (A n +B n ) (A-l 1) 

[R] 04,-i?,) = [/>] T (£„-/!„) (A- 12) 


Here [P] T is the transpose of the matrix [P] , and [R] is an 
mXm diagonal matrix, and [Q] is an n X n diagonal matrix. 


Using the properties in Eqs. (l)-(6) the following is obtained: 
M 

V (A T +B r )P = G4 tt + P tt )Q (A-5) 

/ , v Im Im' m n v Iln lln'^nn v 
m- 1 

where 


Next, Eq. (A-12) is converted into a scattering matrix for- 
mat relating the normalized output vectors B Y and B n to the 
normalized input vectors A x andvt n . 

The submatrices [S n ] , [S 12 ] , [S 2l ] , and [S 22 ] are derived 
from the [P] , [P] T [P], and [ Q ] matrices by simple matrix 
math and Eqs. (A-l 1) and (A-12). 


and 


P 

mn 


Q„ 



{e. X h..) • ds 


(e„ X h,, ) • ds 
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(A-6) is n ] = a*] + fi t Mr 1 m - [/ > i T t^D 

(A-l 3) 

[S i2 ] =2K/R] ([R] + [P] T [R]) [R] T [VS ]" 1 (A- 14) 

(A- 7 ) 

P 21 ] =2 [VS] ([Q] + [P] [P] r ) [P] [VR ]" 1 (A- 15) 


The other boundary condition needed is 
- H u within S 2 


(A-8) 


[S 22 ] = IQ] (IQ] + [P] [P ] 7 )' 1 ([Cl - [R] [R] t ) [VS ]- 1 

(A- 16) 


Following a similar line of reasoning 



(A-9) 


In these equations, [/] represents the unit matrix and 

[VR] [VR] = [R], and [VS] [VS] = [VS] are from 
Eqs. (3) and (7). These factors form the normalization of the 
vectors A and B. This completes the solution for the junction 
between the two different waveguides. 
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A proposed conceptual design to increase the output power of an existing X-band 
radar transmitter used for planetary radar exploration from 365 kW to 1 MW CW is 
presented . The paper covers the basic transmitter system requirements as dictated by the 
specifications for the radar. The characteristics and expected performance of the high - 
power klystron are considered ' and the transmitter power amplifier system is described. 
Also included in the discussion is the design of all of the associated high-power micro- 
wave components , the feed system , and the phase-stable exciter. The expected perfor- 
mance of the beam supply, heat exchanger, and monitor and control devices is also 
presented. Finally , an assessment of the state-of-the-art technology needed to meet 
system requirements is given and possible areas of difficulty are summarized. 


I. Introduction 

Radar has been used as a remote tool for exploration of our 
solar system since 1946, when detection of echoes from the 
Moon was reported. Since then, ground-based radar studies 
have been made of the planets Mercury, Venus, and Mars, the 
Galilean satellites, the rings of Saturn, and nearly a dozen 
asteroids. The sensitivity of the radar instruments used in 
these experiments has increased by a factor of approximately 
10 12 since the first lunar detection. Such great gains in sensi- 
tivity have been achieved by extraordinary improvements in 
antenna size, low-noise receiving systems, high-speed digital 
signal processing, and higher-power transmitters at higher 
frequencies. 


One area that needs further development in order to in- 
crease the remote sensing capability of radar is the increase of 
power in the transmitter. Therefore, at JPL, design and devel- 
opment has been initiated to extend the present X-band trans- 
mitter capability from 365 kW CW to 1 MW CW. 

This experimental transmitter will be installed on the 70- 
meter dual-reflector antenna at Goldstone, California, which is 
equipped with a rotatable, asymmetric subreflector that per- 
mits the use of multiple feed systems at the antenna focus. 
The subreflector can be precision indexed to a fixed number 
of positions that will allow each feed to properly illuminate 
the main reflector. Assuming an aperture efficiency of about 
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75% and corresponding antenna gain of 74.5 dB at X-band, 
with the proposed 1-MW CW transmitter, the X-band radar 
system will have an effective radiated power of about 28 TW 
(2.8 X 10 13 W). 

Using an assessment of the state-of-the-art technology, this 
article describes the X-band transmitter, including the trans- 
mission system and a feed system. 

II. Transmitter System Requirements 

The transmitters used for radar astronomy systems differ 
from conventional radar systems in that they require high 
average power, rather than high peak power, over the band- 
width required to handle the transmitted signal [1]. It is 
also important that these transmitters be coherent in order 
to determine the phase relationships of the returned signals, 
and they must have high phase stability if coherent measure- 
ments are to be made over long periods of time. The transmit- 
ter must also be capable of modulation by a variety of pulse 
programs, while maintaining the phase and amplitude fidelity 
and pulse-to-pulse stability required for pulse-compression sys- 
tems incorporated in the radar. 

The above requirements illustrate that high power alone 
will not provide the desired CW radar transmitter capabilities. 
If this were the case, it might be more easily obtained with an 
oscillator rather than an amplifier. Besides the advantage of 
having dynamic control of amplitude and phase, the appeal of 
using an amplifier is that it eliminates the need for phase- 
locking an oscillator to a control signal. 

Based on the above requirements, the X-band radar trans- 
mitter specifications are given in Table 1. 

III. The Transmitter System 

As shown in Fig. 1, the transmitter will include a power 
supply that converts 2400-V, 3-phase, 60-Hz line voltage to 
direct current at up to 50 kV with a power limitation of 
2.25 MW for the four klystron amplifier beams. The frequency 
synthesizer-based exciter and the buffer amplifier will provide 
an input signal to these klystrons, and each of the four kly- 
strons will provide approximately a 53-dB power gain. The 
automated transmitter control will perform monitoring and 
control of all functions. Protective devices (interlocks) will 
prevent damage to equipment by removing voltage and in 
some cases drive power in the event of a malfunction. The 
liquid-to-air 2.5-MW heat exchanger will be used to cool the 
amplifier, the power supply, various auxiliaries to the trans- 
mitter, microwave components of the transmitter, and micro- 
wave components of the transmission line. The following para- 


graphs describe each of the above components in greater 
detail. 

A. Exciter 

Figure 2 shows the proposed configuration of the exciter 
portion of the 1-MW radar, which is based on an HP 8662A 
synthesizer. The synthesizer uses the 10-MHz reference signal 
from a hydrogen maser to produce a phase-coherent output at 
640 MHz, and a phase-coherent variable-frequency signal from 
10 kHz to 640 MHz with 0.1-Hz resolution, or from 640 to 
1280 MHz with 0.2-Hz resolution. The 640-MHz signal can be 
picked off and mixed with the variable-frequency output. For 
the transmitter, the variable-frequency output is set to 830 
MHz and mixed with 12 times the 640 MHz to produce 8510 
MHz. In the receive configuration, the variable frequency is 
set to 505 MHz and mixed with 12 times the 640 MHz to 
produce a signal at 8185 MHz, which could be used as the first 
local oscillator in the receiver. A similar system, using 3 times 
640 MHz, is used for the S-band exciter and could also be used 
as a first local oscillator for an S-band receiver. 

Reduction of phase noise is a major concern in the exciter 
design. By mixing the output of an extremely low-noise, high- 
frequency synthesizer with low multiples of a clean, fixed- 
frequency oscillator, total phase noise should be reduced by 
more than 20 dB from that produced by the more standard 
method of using a high multiple of a low-frequency synthe- 
sizer. 

Provisions are made for biphase noise modulation, fre- 
quency hopping, and continuous frequency sweep for Doppler 
cancellation. The frequency hopping is accomplished by send- 
ing frequency step commands to the synthesizer on an IEEE- 
488 interface. Doppler cancellation is done through a combin- 
ation of a coarse frequency, set through the IEEE-488 inter- 
face, and an analog voltage at the search oscillator input. Phase 
noise modulation is done directly with a biphase modulator at 
the output frequency. 

A power divider is included in the exciter, providing sepa- 
rate outputs for the phase reference system and the klystron 
drive system in the buffer amplifier. 

B. Buffer Amplifier 

The buffer amplifier is the second major block of Fig. 2. 
For the first choice of transmission line arrangements (see Sec- 
tion D), phase-tracking loops and electronic polarization con- 
trol are provided in the exciter. The phase-tracking loop uses a 
voltage-tracking controlled phase shifter in the drive path of 
each klystron to compensate for phase changes in klystrons or 
microwave components. A sample of the output from each 
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klystron is taken as close to the feedhorn as possible and com- 
pared with the reference signal. Symmetry in the waveguide 
paths from the final splitter is still required to prevent differ- 
ential phase shift between the two horn inputs driven by the 
same klystron. 

Because the feed system uses two klystrons to drive the in- 
line inputs to the horn and the other two klystrons to drive 
the orthomode inputs, polarization control is achieved by the 
phase shifter after the first splitter in the buffer amplifier. 
Phase shifts of ±90 degrees yield right-hand or left-hand circu- 
lar polarization, and phase shifts of 0 or 180 degrees yield 
orthogonal linear polarizations. Other phase shift values pro- 
duce various elliptical polarizations. A corresponding phase 
shift must be introduced in the phase correction loop, but in 
the case of switching from right-hand circular to left-hand cir- 
cular polarization, this can be done with an inverting amplifier 
after the phase detector. For the waveguide-based system, no 
special electronic control is required in the exciter. In both 
cases, a solid-state amplifier for each klystron provides the 
approximately 2-watt drive power required. 

C. Power Amplifier 

The power amplifier section of the transmitter contains 
four 250-kW CW klystrons. The requirements of high power, 
high gain, good efficiency, ease of modulation, and an output 
spectrum free from spurious signals and noise make a klystron 
linear-beam tube the natural choice for radars, as long as its 
narrow bandwidth, high operating voltage, and large size can 
be tolerated. 

Early in 1986, Varian Associates was contracted by JPL to 
do a paper design for eventual development and production of 
extremely high-power X-band klystrons. The characteristics of 
this tube, designated VKX-7864A, are given in Table 2. 

As part of the design for this klystron, Varian is expected 
to provide phase modulation sensitivity due to various pushing 
factors such as beam voltage, drive power, body coolant, focus 
current, and heater voltage. These phase-pushing factors are 
important because they produce unwanted discrete lines on 
the phase noise spectrum. 

Based on overall transmitter system specifications, these 
modulation sensitivities, together with modulation sensitivity 
of other pushing factors, establish the requirements for power 
supply stability, body coolant temperature stability and sta- 
bilities of other pushing (control) factors. 

Each klystron is provided with an arc detector at the win- 
dow and a reverse power coupler for protection. In the event 


of a fault, these monitors will remove the RF drive before per- 
manent damage can occur. 

One of the critical elements of the klystron is the guiding 
magnet. This device is a solenoid which surrounds the inter- 
action volume and keeps the electron beam focused in the 
tube length before the collector. A control of better than 1% 
must be exercised to maintain high efficiency and low body 
current. This solenoid will weigh about 300 pounds and will 
require a 300-V, 20-A dc power supply to provide the proper 
magnetic field. 

The separate coolant manifold for each klystron will moni- 
tor and control flows, temperature, and pressure. This data 
will be routed to the data collection system, which is described 
in Section H. 

The power amplifier, including the transmission line (de- 
scribed in the next section) will all be housed in the cone. The 
mechanical layout is shown in Fig. 3. 

D. Transmission Line 

Two alternatives for the transmission line system are under 
consideration. The first system is shown in Fig. 4. For this sys- 
tem, the output from each klystron passes through a wave- 
guide switch and a directional coupler before being split into 
two 125-kW signals. Four of these signals (two pairs) feed the 
in-line ports of four orthomode junctions, while the other pair 
feed the ortho ports. Thus, by adjusting the relative phase be- 
tween the two pairs of klystrons, one of two orthogonal linear 
polarizations, RHCP or LHCP, may be obtained (see buffer 
amplifier, Section C). The outputs of the four orthomode 
junctions then feed the four inputs to the multimode feed- 
horn. In this system, phase detectors and electrically con- 
trolled phase shifters will be used to adjust the outputs of each 
of the klystrons and to provide polarization control. The re- 
liance on electronics reduces the complexity of the waveguide 
layout in comparison to the waveguide-based system described 
below. 

Figure 5 shows the waveguide-based alternative to the pre- 
vious system. This system is similar to that used at the Hay- 
stack Hill Observatory in Westford, Mass. [2] . In this system, 
a series of splitters and combiners ultimately forms four iden- 
tical signals. Each of the signals is composed of 25% of the 
power from each klystron, and all four are equal in amplitude 
and phase, independent of the four klystron outputs. Phase 
shifters are used in each of the drive lines to the klystrons in 
order to minimize the power in the waster loads. This adjust- 
ment is made once, and if klystron parameters or frequencies 
change during a track, only the waster-load power will change. 
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Polarization is changed mechanically through switches imme- 
diately before the orthomode junctions. Although this ap- 
proach is complicated mechanically, it has the advantage that 
beam position and polarization purity are virtually guaranteed 
without the use of any electronics. 

The final decision on which of the two possible systems, or 
a combination thereof, will be used depends on how closely 
the four klystron tubes can be matched in terms of phase, 
gain, and group delay versus frequency. Measurements on the 
two existing tubes as well as discussions with the tube manu- 
facturer (Varian Assoc.) are underway to help answer these 
questions. The effects of aging must also be considered to 
guarantee that the system will run reliably over the expected 
lifetime of the tubes with minimal adjustment. 

In both systems, WR125 is used as the high-power wave- 
guide to avoid operation close to the higher-order modes in 
WR137, which begin propagating at 8600 MHz [3] . 

The electrically operated waveguide switches allow selec- 
tion of the radar antenna or water loads for termination of 
transmitter output power. The water loads will also be used 
for calibrating the output power calorimetrically. 

On receive, the klystrons are turned off and the switches 
are rotated, connecting the receive waveguide to the feed- 
horn. Through a series of combiners, RHCP and LHCP signals 
are formed. These signals then enter the dual-channel maser, 
and finally the heterodyne receiver. 

In addition, the existing low-noise system, which uses a 
separate corrugated horn for receiving, will be retained. The 
disadvantage of this receiving arrangement is that the antenna 
subreflector must be rotated between transmit and receive 
cycles. 

E. Antenna Feed System 

The final component required in the transmission line for 
the radar system is the feedhorn, which will launch the trans- 
mitter power toward the subreflector of the antenna. The horn 
should illuminate the subreflector efficiently and increase the 
overall noise temperature of the system as slightly as possible. 
The horn will be designed to meet or exceed the RF perfor- 
mance of the feedhorns presently in use, with the added 
feature of 1-MW capability. 

Several possible feed types were considered for the 1-MW 
system. Experience indicates that conventional corrugated or 
dual-mode horns are not capable of handling the large CW 
power without breaking down at the small-diameter input sec- 
tion. It was also found that a rather large number of small 


horns arranged in a closely packed array would be required to 
illuminate the subreflector efficiently. Due to the complexity 
of this type of system, as well as the losses associated with the 
power-splitting components, the array concept was also 
rejected for the 1-MW system. 

The chosen design uses a multiflare rectangular horn [4] . 
Such a horn is well suited to the 1-MW system, since it pos- 
sesses an excellent radiation pattern and has been used in other 
power applications. 

The multimode horn is depicted in Fig. 6. Four square 
waveguides feed a large square chamber where the power is 
launched into a square multiflare horn. Since the large chamber 
is oversized for the frequency of interest, the sum of the 
power in the four waveguides can be supported without break- 
down. In the present case, each of the guides must support 
250 kW, and the large chamber 1 MW. Flare angle changes are 
used to generate the required higher-order modes for pattern 
symmetry. 

The analysis of the horn is carried out using mode-matching 
methods. The overall scattering matrix of the horn is obtained 
and, using these results, the input match, as well as the far- 
field radiation pattern, can be predicted for arbitrary input 
levels and phases in the four input guides. Calculated radiation 
patterns for the horn at 8.51 GHz are shown in Fig. 7. A de- 
tailed description of the horn will be provided in a future 
report. 

Estimates for the peak electric fields in the horn indicate 
that the maximum level (about 6.9 kV/cm) occurs in the four 
feeding waveguides, which are 0.95 in. square. This should be 
compared to the present 400-kW WR125 waveguide system 
(about 8.5 kV/cm) and the theoretical limit for a 0.95-in.- 
square waveguide, which is about 2.1 MW. Resonant ring tests 
[5] will be used to evaluate the power performance of the 
orthomode junctions and the horn. Should arcing become a 
problem, backup approaches include evacuating areas of the 
horn or pressurizing them with a dielectric gas such as SF 6 . 

F. Beam Power Supply 

A block diagram of the beam supply is shown in Fig. 8. 
Power at 12,600 V, 3 phase, and 60 Hz is supplied to sepa- 
rate substations from the commercial line (Edison), which is 
underground for the last mile. The 2400-V substation supplies 
the main motor generator only. Under critical operation, this 
2400-V, 60-Hz power is supplied separately by the diesel gen- 
erator. All auxiliaries are supplied from a 480-V substation. 
The output of the main motor-generator at 400 Hz is stepped 
up in voltage, transformed, rectified, and delivered to the 
klystron load through a filter, crowbar, and series limiter 
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resistor to voltages of 50 kV and 2.25 MW. The design of this 
power supply will maintain output ripple under full load of 
less than 0.05%, regulation of 0.01%, and settling time of 
200 milliseconds. 

The use of a frequency converter (such as a motor-generator) 
might seem unnecessary, but it actually provides worthwhile 
technical and economic advantages. It isolates the power line 
from a crowbar of the dc supply and greatly simplifies line 
protection problems. It also isolates the supply from short- 
duration line voltage fluctuations and transients due to the 
large inertia of the rotating components. The change from 
60 to 400 Hz reduces all transformer and filter sizes and 
costs. 

This beam supply is required to provide 50 kV between the 
klystron collector and cathode at a beam current of 11 amps 
(per klystron) during the long radar pulses (up to 10 hours). 

The ability of the beam supply to remain ripple free and 
tightly regulated, and to settle in 200 msec will require a 
unique and state-of-the-art feedback control circuit. The 
design is in progress and the first approach is being tested. 
The supply must also be capable of withstanding the stress 
imposed on it when an arc occurs in any klystron. 

G. Cooling System 

The cooling system provides a 2.5-MW cooling capacity 
for klystrons, focusing magnets, high-power microwave com- 
ponents, water loads, feed, and the transformer rectifier, 
including the motor-generator clutch. Basically, the cooling 
system is a closed loop which consists of a heat exchanger, a 
distribution manifold, and all connecting piping. Coolant is 
circulated through the cooling system by the heat exchanger 
pumps. The coolant gains heat as it passes through the RF 
system (buffer amplifier, power amplifier, and microwave 
components) and loses heat as it passes through the heat 
exchanger. A purity loop is connected to the input of the 
heat exchanger to maintain the purity of the coolant. 

As part of this transmitter modification, the pumps will 
have to be upgraded, including replacing all of the 6-inch 
water lines, and a complex water-switching mechanism will 
have to be installed. The water-to-water heat exchanger will 
have to be changed to a liquid-to-air heat exchanger. 

H. Monitor and Control 

Operation of the 1-MW radar will be extensively automated. 
The control system will be composed of an HP Industrial 


Vectra computer, two HP 38526 Data Acquisition and Control 
units, a frequency counter, and a multichannel power meter. 
All the instruments will be connected by an IEEE-488 inter- 
face bus, with a fiber optic extension between the control 
room and the cone area of the antenna. The IEEE bus will also 
connect to the synthesizer in the exciter. 

The monitor and control software will be written in Ada, 
a language especially designed for hardware control applica- 
tions, and will make use of artificial intelligence principles to 
maximize the system functionality while maintaining a simple 
user interface. It will automatically keep long-term data logs 
to look for trends and predict failures before they can disable 
the radar. It will correlate data from different sources to dis- 
tinguish between sensor problems and transmitter problems, 
and it will be able to calibrate the RF power meters by preci- 
sion measurement of the flow rates and temperature rises of 
the coolant in the water loads. 


IV. Concerns and Conclusion 

The previous sections make it clear that this radar transmit- 
ter will be a very complicated system, and therefore there are 
several areas of concern. Of primary concern is the possibility 
of waveguide/feedhorn breakdown. Associated with this prob- 
lem is the difficulty of testing the components at or above the 
normal operating power level before all of the klystrons arrive. 
Resonant ring testing can be used for some components, but 
the most critical component, the feedhorn, can be tested 
under full power only when all four klystrons are available. 

Several additional areas of concern involve the klystron 
tubes. Both systems under consideration, particularly the 
transmission line system (Option 1), demand that the four 
amplifiers have matched gain and phase characteristics over 
the band of interest. This tube-to-tube matching could become 
difficult, particularly given the stretched procurement sched- 
ule for these tubes. Also, in order for the radar to operate at 
full output power, a VSWR of less than 1 .05 to 1 must be pre- 
sented to each tube. This is a difficult requirement. The feed- 
horn window, which must be capable of handling 1 MW CW, 
is also a concern. If the present material (kapton) is not suit- 
able, an alternative must be found. 

Finally, this radar transmitter would be the most compli- 
cated transmitter in the field, with the most densely populated 
feedcone on the 70-meter antenna, and would require special 
knowledge and care from the maintenance personnel at the 
site. 
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Table 1. 1-MW X-band radar transmitter specifications 


Parameter 

Specification 

Frequency 

8.51 GHz 

Bandwidth 

20 MHz (-1 dB) 

6 MHz (normal usable range) 

RF output power 

1 MW CW (+90 dBm) 

RF stability 

±0.25 dB over one planetary transmit/ 
receive cycle 

Incidental AM 

60 dB below carrier at all modulating 
frequencies above 1 Hz 

Phase stability 

10' 15 (1000 sec) 

Incidental PM (jitter) 

<1° peak to peak 

Transmit period 

30 sec min. to 10 hr max. 

Modulation: 

Phase Modulation 

Biphase, 40-dB carrier suppression, 
dc to 20 MHz 

Frequency 

±2 MHz every few seconds 

Hopping 

Frequency 

±2 MHz in 200 msec 

Ramping 

Group Delay Dispersion 

10 nsec over 6-MHz bandwidth 

Polarization 

RHCP or LHCP (one polarization at a 
time; cross polarization <-25 dB) 


Table 2. Characteristics of VKX-7864A X-band klystron 

Parameter 

Specification 

Frequency 

8510 MHz 

Bandwidth 

20 MHz (1-dB points) 

Output power 

250 kW min. 

Beam voltage 

50 kV 

Beam current 

11 A 

Efficiency 

50% 

Gain (sat) 

53 dB 

Filament voltage 

15 V 

Filament current 

20 A 

Magnet voltage 

300 V 

Magnet current 

20 A 

Klystron weight 

300 lb 

Klystron height 

5 ft 


BUFFER AMP 
AND PHASE 
CORRECTION 


POWER AMP 
(4) KLYSTRONS 
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CROWBAR AND 
BEAM CONTROL 


DRIVE 

INHIBIT 


CENTRAL TRANSMITTER 


MONITOR AND CONTROL 
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MICROWAVE 
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EXCHANGER 


MOTOR-GEN 
{400 Hz) 


DIESEL OR 
EDISON 
2400 V, 60 Hz 

BEAM SUPPLY 


RADAR CENTRAL CONTROL 
HIGH SPEED DATA 
ACQUISITION SYSTEM 


Fig. 1. 1'MW X-band radar transmitter block diagram. 
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Fig. 4. Transmission line system (Option 1). 
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Current plans for supporting Voyager's encounter at Neptune include the arraying of 
the DSN antennas at Goldstone, California , with the National Radio Astronomy Observa- 
tory y s Very Large Array ( VLA ) in New Mexico. Not designed as a communications antenna , 
the VLA ’s signal transmission facility suffers a disadvantage in that the received signal is 
subjected to a “gap” or blackout period of approximately 1.6 msec once every 5/96 sec 
control cycle. 

Previous analyses showed that the VLA data gaps could cause disastrous performance 
degradation in a VLA stand-alone system and modest degradation when the VLA is 
arrayed equally with Goldstone. These basic conclusions were independent of whether 
Voyager was using its convolutional code alone or the convolutional code concatenated 
with its Reed-Solomon outer code . 

New analysis indicates that the earlier predictions for concatenated code performance 
were overly pessimistic for most combinations of system parameters , including those of 
Voyager-VLA. The periodicity of the VLA gap cycle tends to guarantee that all Reed- 
Solomon codewords will receive an average share of erroneous symbols from the gaps. 
The number of gapped symbols is not subject to the same kind of statistical fluctuations 
that govern the ordinary random errors the code must also overcome . However , large 
deterministic fluctuations in the number of gapped symbols from codeword to codeword 
may occur for certain combinations of code parameters , gap cycle parameters , and data 
rates. In this article , several mechanisms for causing these fluctuations are identified and 
analyzed. 

Fortunately , the Voyager-VLA parameters do not produce wild fluctuations in the 
number of gapped symbols from codeword to codeword. The result is graceful degrada- 
tion of concatenated code performance due to the VLA gaps ; even for a VLA stand- 
alone system. The magnitude of the deterioration at a constant concatenated code bit 
error rate of 10~ 5 is 0.5 dB to 0.6 dB for a VLA stand-alone system and 0.3 dB to 0.4 dB 
for the VLA arrayed equally with Goldstone. 

Even though graceful degradation is predicted for the Voyager-VLA parameters ; cata- 
strophic degradation greater than 2 dB can occur for a VLA stand-alone system at certain 
non-Voyager data rates inside the range of the actual Voyager rates . Thus, it is imperative 
that all of the Voyager- VLA parameters be very accurately known and precisely controlled. 
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I. Introduction 

Current plans [1] for supporting Voyager’s encounter at 
Neptune include the arraying of the DSN antennas at Goldstone, 
California, with the National Radio Astronomy Observatory’s 
Very Large Array (VLA) in New Mexico. The fully arrayed 
VLA operating in a stand-alone mode potentially provides 
about the same receiving capability as the Goldstone complex. 
The VLA arrayed with Goldstone would seem to offer up to 
two times greater data rates than Goldstone alone. 

Not designed as a communications antenna, the VLA’s 
signal transmission facility unfortunately suffers a disadvan- 
tage in that the received signal is subjected to a “gap” or 
blackout period of approximately 1 .6 msec once every control 
cycle. During the blackout period, the received signal is not 
transmitted from the antennas to the processing facility. The 
control cycle is 5/96 sec (approximately 52 msec), so the 
blackout period constitutes about 3% of the total receiving 
time. 

If the VLA were used in a stand-alone mode to receive 
uncoded data, the data received during the gaps would be 
irretrievably lost. The resulting bit error rate, averaged over 
gapped and ungapped periods, could be no better than about 
1.5%, even if no errors occurred outside the gaps. Arraying the 
VLA with Goldstone provides some capability during the 
gapped periods, but the overall error rate is still at least 3% of 
the error rate that would prevail based on the Goldstone-only 
aperture without any assistance from the VLA. 

The high raw error rates during the gaps can potentially be 
overcome by coding the data. All of Voyager’s telemetry data 
is convolutionally encoded, and the memory and error correc- 
tion capability of the convolutional code provides a mecha- 
nism for bridging small gaps in the data. Unfortunately, the 
convolutional code’s correction capability is limited to approx- 
imately the memory length of the code, and the VLA gaps are 
longer than the Voyager code’s memory length (6 bits) for 
data rates greater than 3.75 kbits/sec. 

Voyager’s compressed imaging data is Reed-Solomon en- 
coded in addition to being convolutionally encoded. Each 
Reed-Solomon codeword consists of 255 eight-bit symbols, 
and blocks of four codewords are interleaved symbol by sym- 
bol. Thus, more than 8000 data bits are transmitted between 
the beginning and end of a codeword. At Voyager-Neptune 
data rates of 21.6 kbits/sec or lower, each Reed-Solomon 
codeword is decoded based on symbols accumulated over a 
minimum of seven or eight complete gap cycles, so the Reed- 
Solomon decoder tends to see an average mix of gapped and 
ungapped symbols. Since Voyager’s Reed-Solomon code can 
correct about 6% erroneous symbols, the code can potentially 


withstand 3% gapped symbols with a reserve error correction 
capability of 3% to handle normal ungapped symbol errors. 


II. Previous Analysis 

The general conclusions based on the simple reasoning in 
Section I are largely valid, but more detailed analysis is neces- 
sary to quantify the deleterious effects of the data gaps and 
to detect anomalous situations when the “average” behavior 
is not a valid determinant of overall performance. An analysis 
of the effects of the VLA data gaps on Voyager’s convolu- 
tionally coded and concatenated coded data was reported 
years ago [2] , [3] when the possible use of the VLA for 
Voyager was first foreseen. The earlier analysis first examined 
the effects of the data gaps on convolutionally coded data via 
a full software simulation of the Viterbi decoder that accu- 
rately modeled the VLA gap cycle. Then the average Viterbi 
decoder error rates predicted by the simulation were used as 
the basis for a theoretical calculation of the performance of 
the convolutional/Reed-Solomon concatenated code. 

The intuitive conclusions about the performance of a 
stand-alone VLA receiving convolutionally coded data were 
borne out by the simulations. As shown in Fig. 4 of [3] , the 
decoded error rate decreases slowly as a function of signal-to- 
noise ratio and then flattens out at an unacceptable value 
around 1%. The exact limiting error rate is a function of the 
data rate and the duty cycle of the gap period, and it approaches 
1.5% for high data rates and the VLA duty cycle of 3% gaps. 
When the VLA is arrayed equally with Goldstone, so that the 
overall signal-to-noise ratio drops by only 3 dB during the 
gaps, the error rate curve retains its usual character and does not 
approach a saturation value over the interesting range of error 
rates. 

The earlier theoretical calculations of the Reed-Solomon 
code’s performance in Fig. 9 of [3] show the same leveling off 
of error rate as a function of signal-to-noise ratio when the 
VLA is unassisted by Goldstone. This implies unacceptable 
performance for a VLA stand-alone system, no matter how 
high the signal-to-noise ratio at the VLA. The Reed-Solomon 
performance deterioration when the VLA’s signal is augmented 
by an equal signal from Goldstone is not so dramatic, as the 
error rate curves again retain their usual character but drop off 
more slowly. 

The earlier analysis reached sharply different conclusions 
about VLA performance with and without assistance from 
Goldstone. Because of the predicted potential for devastating 
degradation, the causes of which were not fully understood, 
the VLA gap analysis was reopened in order to pin down the 
precise error mechanisms before Voyager’s Neptune encounter 
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next year. It was not possible to improve on the earlier anal- 
ysis of the convolutional code’s performance, because the 
simulation accurately modeled both the Viterbi decoder and 
the VLA gap cycle. However, a somewhat more detailed 
analysis of the Reed-Solomon code’s performance was under- 
taken, and this new analysis is reported here. 

The new analysis of the performance of concatenated 
coding yields less pessimistic conclusions about the effect of 
the data gaps in a VLA stand-alone system. The regularity of 
the gap cycle helps to eliminate the possibility of larger than 
average numbers of errors due to the gaps. On the average, the 
Reed-Solomon code’s error correction capability can take 
care of errors during the gaps, and this average behavior is 
overwhelmingly likely to occur for most combinations of 
system parameters. On the other hand, the new analysis re- 
veals that certain combinations of parameters are taboo if the 
type of ruinous degradation predicted by the old analysis is 
to be avoided. 


III. Various Possible Analytical Approaches 

There are several possible approaches to calculating the 
theoretical performance of the Reed-Solomon code in the 
presence of data gaps. Three such approaches are shown in 
Fig. 1 . The simplest such approach, called the one -level model, 
was the approach used in the earlier analysis. The most com- 
plex and accurate approach, the simulated error stream model, 
is not feasible. The middle approach, the two-level model, is 
the one taken in the current analysis. 

The single-level model is based on the following expression 
(cf. Eq. 2 of [2] ) for calculating the concatenated code’s bit 
error probability: 
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Equation (1) expresses the bit error probability^ of the con- 
catenated channel in terms of the bit error rate p and the 
Reed-Solomon symbol error rate 7r of the output from the 
Viterbi decoder. This expression assumes an (A, A - 2 E) 
Reed-Solomon code, which can correct up to if symbol errors 
per A-symbol codeword. In [2] and [3] , the average error 
probabilities p and i r characterizing the Viterbi decoder output 
were obtained by means of a detailed simulation of the Viterbi 
decoding process, including an accurate model of the signal-to- 
noise ratio fluctuations over the VLA gap cycle. The model 
based on Eq. (1) is termed the “single-level” model, because 
the Reed-Solomon error probability is calculated using single 
overall average values of p and tt to characterize the Viterbi 


decoder behavior, without regard to deterministic fluctuations 
in p and tt between the gapped and ungapped portions of a gap 
cycle. 


The validity of Eq. (1) rests on the assumptions that suc- 
cessive Reed-Solomon symbol errors are independent and 
identically distributed. As stated in [2 ] , independence of sym- 
bol errors is a good assumption for Voyager because Voyager’s 
8-bit Reed-Solomon symbols are interleaved to depth 4 and 
Viterbi decoder error bursts of 32 or more bits are highly 
unlikely for the (7, 1/2) convolutional code. On the other 
hand, the assumption of identically distributed symbol errors 
throughout a Reed-Solomon codeword should be altered to 
account for the deterministic periodicity of the gap cycles. 
Symbols occurring during the gaps have a higher error rate 
than symbols occurring outside the gaps. 

Ideally, a set of A values of p and tt should be calculated 
from the Viterbi decoder simulation for each possible starting 
“phase” of the gap cycle relative to the Reed-Solomon code- 
word boundaries. Equation (1) can be easily modified to allow 
the values of p and tt to vary symbol by symbol throughout 
the codeword. The error rate calculated from this “multilevel” 
model can then be averaged over all possible relative phases of 
the gap cycle to obtain the overall average bit error rate. 

The analysis in the present article is not based on this gen- 
eral multilevel model for the Viterbi decoder’s output statistics. 
Rather, it assumes that two levels will suffice: one set of values 
for p and 7r during the ungapped portion of the gap cycle and 
another set of values during the gaps. Intuitively, the two-level 
model should become exact in the limit of very high data 
rates, as the widths of both the gapped and ungapped periods 
become long with respect to the memory length of the convo- 
lutional code. In this limit, the Viterbi decoder has a chance 
to settle into steady-state values of p and 7r both inside and 
outside the gaps, and the number of “transition” bits and sym- 
bols characterized by intermediate values of p and 7r is small 
relative to the number of bits and symbols characterized by 
the steady-state gapped and ungapped values. 

Another form of the multilevel model consists of dispens- 
ing with the theoretical calculations altogether and instead 
feeding the simulated output of the Viterbi decoder directly 
into a simulation of the Reed-Solomon code’s performance. 
Tests of this type are impractical because of the monumental 
amount of simulated data that must be collected before 
statistical confidence in the results can be obtained. For 
example, at an operating point of 10~ 4 Reed-Solomon code- 
word error probability (corresponding to a concatenated 
code bit error probability of about 3 X 10” 6 ), the average 
waiting time for each erroneous codeword is about 20 million 
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bits. Simulating enough Viterbi decoded data to produce a 
statistically valid sample of erroneous codewords was not 
feasible. However, similar end-to-end tests of the gapped VLA 
data have been performed using real test data from CTA-21. 1 


IV. Details of the Two-Level Model 

At Voyager’s data rates, the length of the ungapped portion 
of each gap cycle is several hundred to more than a thousand 
bits long, so the steady-state assumption on which the two- 
level model is based appears justified for the ungapped zone. 
The length of the gaps, however, is at most around 35 bits (or 
about 6 memory lengths of the convolutional code) at Voy- 
ager’s highest data rate of 21 .6 kbits/ sec. Thus the accuracy of 
the two-level model is somewhat questionable for the gapped 
zone. However, it should still give a better prediction of con- 
catenated code performance than the single-level model. 

The two-level model used in this article is a model for the 
decoded output of the Viterbi decoder. The Viterbi output 
bit error rate is allowed to vary between two levels. The two 
corresponding types of errors are referred to as “gapped” 
errors and “ungapped” errors, respectively. Each Viterbi de- 
coded bit is characterized by one of two bit error probabili- 
ties p 0 or Pj, and each Reed-Solomon symbol is characterized 
by one of two symbol error probabilities 7r 0 or 71 ^ . Gapped 
bits and symbols have error probabilities p Q and 7r 0 , and un- 
gapped bits and symbols have error probabilities p x and tt x . 

The ungapped error probabilities p x and 7Tj are assumed to 
be the steady-state Viterbi decoder output error probabilities 
for a decoder operating at the ungapped signal-to-noise ratio 
E h /N 0 , and the gapped error probabilities p 0 and 7r 0 are as- 
sumed to be the corresponding error probabilities for a decoder 
operating at the reduced signal-to-noise ratio inside the gap. 
The signal-to-noise ratio inside the gap is zero for the VLA 
stand-alone system, and for the VLA arrayed with Goldstone 
it is reduced from the signal-to-noise ratio outside the gap by 
an amount reflecting the array ratio. For equal contributions 
from the VLA and Goldstone, the signal-to-noise ratio reduc- 
tion inside the gaps is 3 dB. Other array ratios considered in 
this article correspond to gap reductions of 1.5 dB and 5 dB. 

A summary of the two-level model for the cases of a VLA 
stand-alone system and the VLA arrayed with Goldstone is 


J M. Varuna, “VLA Standalone Test Results for 3.6 KBPS and 7.2 KBPS 
Voyager Telemetry Data Rates," Interoffice Memorandum Voyager- 
GDSE-8 7-056, Jet Propulsion Laboratory, Pasadena, California, 
September 16, 1987. 


shown in Table 1. Performance curves showing the relation- 
ship between the bit and symbol error probabilities and the 
signal-to-noise ratio for the Voyager code parameters are 
shown in Fig. 2 2 . The curves in this figure are taken from 
Fig. 3-l(a) of [4] , together with the results of some new 
simulations of the Voyager code for signal-to-noise ratios 
E b /N 0 near 0 dB or lower. The Viterbi decoder’s performance 
in this normally uninteresting range of E b /N 0 is relevant for 
the VLA gap analysis, because signal-to-noise ratios inside the 
gap can be 0 dB or lower whenever signal-to-noise ratios out- 
side the gap are near the normal operating point of the decoder. 

The classification of bits and symbols as “gapped” or “un- 
gapped” is not very precise. To account for the error correc- 
tion capability of the Viterbi decoder, some of the bits decoded 
during the gap period should be considered to be effectively 
the same as ungapped bits. By an argument presented in [2] , 
the Viterbi decoder can correct exactly A" - 1 of the gapped 
bits when the signal-to-noise ratio is zero inside the gaps and 
infinite outside the gaps. Here, K is the constraint length and 
K - 1 is the memory length of the convolutional code (K = 7 
for the Voyager code). In this limiting case, the effect of the 
convolutional code is equivalent to converting K - 1 gapped 
bits (characterized by the signal-to-noise ratio inside the gap) 
into K - 1 ungapped bits (characterized by the signal-to-noise 
ratio outside the gap). In this article, it is assumed that the 
convolutional code effectively accomplishes this same conver- 
sion of K - 1 gapped bits into K - 1 ungapped bits, even 
though the signal-to-noise ratios of interest are not exactly 
zero inside the gaps or infinite outside the gaps. 

Blocks of / consecutive bits are grouped to form symbols 
for the Reed-Solomon code (/ = 8 for the Voyager code). A 
/-bit Reed-Solomon symbol is in error if any one of its bits is 
incorrect. Therefore, it is appropriate to classify a Reed- 
Solomon symbol as a gapped symbol if at least one of its / 
component bits is a gapped bit. Because one gapped bit can 
cause a whole /-bit symbol to be classified as a gapped symbol, 
a block of consecutive gapped bits is effectively lengthened by 
an average of /- 1 bits for the purpose of calculating the 
number of gapped symbols. In other words, a block of B con- 
secutive gapped bits corresponds, on the average, to (B+J- 1 )// 
gapped symbols. This effect will be analyzed more closely in 
Section VII. 


2 The Viterbi decoder error probabilities in Fig. 2 are plotted versus 
the signal-to-noise ratio E^Nq for the convolutional code only. In 
this figure, E b represents the signal energy per convolutionally en- 
coded bit. In Figs. 3 through 6 and 14 through 19, which show con- 
catenated code error probabilities, the E b jN 0 axis represents the 
signal-to-noise ratio for the concatenated system, i.e., E b is the signal 
energy per Reed-Solomon encoded information bit. 
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The actual length of the gaps is effectively reduced by 
about K - 1 bits due to the error correction capability of the 
Viterbi decoder, but then increased by an average of/ - 1 bits 
by the ability of one bad bit to knock out an entire symbol. 
The net adjustment, J -K, equals just one bit for the Voyager 
code parameters (/ - 8, K = 7), so any reasoning based on the 
physical gap length rather than the effective gap length is 
probably valid for the Voyager case. However, the model used 
in this article will keep track of these two compensating effects 
separately, so it can be applicable to combinations of A” and/ 
which might not cancel each other so neatly. 


Since the two-level model is a model for the Viterbi decoder 
output statistics, it can be tested against the results of the de- 
tailed simulations conducted in [2] and [3] . The test can 
check whether the overall Viterbi decoder error statistics 
(averaged over gapped and ungapped periods) predicted from 
the simulations match the average of the two levels of statis- 
tics used in the two-level model. The test cannot directly 
check whether the two-level model is adequate for the pur- 
poses of calculating concatenated code performance, since 
detailed Reed-Solomon code performance simulations coupled 
with the Viterbi decoder simulations are not available as a 
benchmark. A corroboration of the two-level model at the 
level of the Viterbi decoded output is reported in the Appendix. 


V. Code Parameters, Gap Cycle Parameters, 
and Data Rates 

In order to apply the two-level model to the calculation of 
the effects of the VLA gaps on concatenated code error rates, 
several additional parameters need to be defined. The con- 
straint length K or memory length K - 1 of the convolutional 
code and the symbol size / of the Reed-Solomon code's sym- 
bols have already been discussed in the description of the basic 
model. Other code parameters that affect the concatenated 
code performance are the Reed-Solomon code’s word length 
N, its error correction capability E , and its interleaving depth 
I. Essential parameters of the VLA gap cycle are the gap length 
G and the total length of the gap cycle T. The final parameter 
that influences the model’s prediction of concatenated code 
performance is the data rate R. In this article, R is defined as 
the Viterbi decoder output bit rate. The corresponding chan- 
nel symbol rate is 2 R for Voyager’s rate 1/2 convolutional 
code. The redundant nondata bits inserted by the Reed- 
Solomon code are counted toward the data rate R as defined 
here. 

Table 2 lists the values of these essential parameters for the 
Voyager-VLA configuration. 


VI. Conditional Concatenated Code Error 
Probabilities 

The Reed-Solomon decoder error probability depends on 
the two symbol error probabilities tt 0 , 7Tj, and also on the 
number of symbols of both types within a Reed-Solomon 
codeword. Let n 0 denote the number of “gapped” symbols 
with error probability 7r 0 , and n x the number of “ungapped” 
symbols with error probability 7r r The total Reed-Solomon 
word length is N= n Q + n l . The number of gapped symbols 
n 0 depends on the data rate R, the gap length G and gap 
period 7\ the Reed-Solomon symbol length / and codeword 
length N, the convolutional code constraint length K , and the 
“phase” of the gap cycle relative to the Reed-Solomon code- 
word boundaries. 


The symbol error probability for the output of the Reed- 
Solomon decoder can be evaluated from a generalization of 
Eq. (1) that accounts for the two input symbol error proba- 
bility levels. The answer also depends on whether it is evalu- 
ated for a gapped symbol or an ungapped symbol. If P sQ and 
P sl denote the output gapped and ungapped symbol error 
probabilities, respectively, then 


so 
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The corresponding output bit error probabilities P b0 andP bl 
are obtained by multiplying these expressions by the condi- 
tional probability of a bit error, given a symbol error, 


P 


b 0 



(4) 


P 


b 1 



(5) 


where p 0 and p 1 are the Viterbi decoder output bit error 
probabilities for gapped and ungapped bits, respectively. 


The overall average symbol and bit error rates P s and P b 
output from the Reed-Solomon decoder are obtained by aver- 
aging the expressions for P sQi P gl and P b0 ,P bl over the n 0 
gapped symbols and the n x ungapped symbols. 
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ratio and totally random steady-state Viterbi decoded bits. 
^ ' The gapped bit and symbol error probabilities are p 0 = 1/2 and 
7r 0 = (2 J - l)/2 / , where J is the number of bits comprising a 
symbol. Since / = 8 for the Voyager Reed-Solomon code, a 
(7) good simplifying approximation is 7r 0 1 . Substituting 

7r 0 = 1 into Eqs. (2) and (3) leads to 


The bit and symbol error probability formulas given in 
Eqs. (2) through (7) can be regarded as expressions for the 
conditional bit error probability, given knowledge of the num- 
ber of gapped symbols n 0 in a codeword. The computation of 
this conditional error probability is a convenient intermediate 
step toward the eventual computation of the unconditional 
error probability, because it separates the performance evalua- 
tion into two reasonably distinct parts. The first part shows 
how sensitive the performance is to variations in the number 
of gapped symbols per codeword, but it can be analyzed with- 
out reference to any peculiarities of the gapping mechanism 
which might cause the number of gapped symbols to vary 
from codeword to codeword. The evaluation of the condi- 
tional bit error probability can be performed independently 
of many of the parameters in the problem, including the data 
rate R , the gap length G, the gap cycle T , and the code’s 
interleaving depth /. The second step in the overall perfor- 
mance evaluation is to evaluate the interplay of these remain- 
ing parameters in determining the critical number of gapped 
symbols per codeword. 

This separability of the overall problem, with just one 
crucial parameter serving to link the two parts of the analysis, 
is a big advantage in favor of the two-level model relative to 
multilevel models or a full-scale combined simulation of the 
Viterbi decoder and Reed-Solomon performance. Whatever 
exactness the simpler two-level model might lack relative to 
multilevel models is compensated by increased insight into what 
effect each of the parameters has on the overall performance. 

Figures 3 through 6 show the evaluation of the Reed- 
Solomon conditional bit error probability for various values of 
the parameter n 0 . Figure 3 shows the performance curves for 
a stand-alone VLA system, and the next three figures apply to 
a VLA-Goldstone array with varying contributions from each 
component of the array. The curves in each figure were evalu- 
ated using values of gapped and ungapped bit and symbol 
error probabilities, p 0 , 7r 0 , p x , 7Ti , obtained from the baseline 
steady-state Viterbi decoder performance curves in Fig. 2. The 
Reed-Solomon code parameters N and E were fixed at the 
Voyager code’s characteristics, TV = 255 andii = 16. 

It is instructive to examine the curves for the VLA stand- 
alone case. When the VLA is unassisted by Goldstone, the 
received data signals are totally lost during the time of the 
VLA gap. A total gap is characterized by zero signal-to-noise 



( 9 ) 


as long as n 0 is not greater than E. The error probability for- 
mula of Eq. (9) for the ungapped symbols is the same equation 
as for the symbol error probability under a one-level model for 
a code of blocklength N ~ n Q capable of correcting E - n Q 
errors. Thus, the effect of a total gap on the ungapped symbols 
is simply to use up some of the error-correcting capability of 
the Reed-Solomon code. The performance degradation in the 
ungapped zone due to the gap is equivalent to the degradation 
that would result from substituting a less powerful code. 
Figure 3 effectively shows how well a series of less and less 
powerful codes performs relative to the baseline performance 
of the Voyager code (corresponding to the curve in Fig. 3 
labeled n Q = 0). The performance curves deteriorate very rapidly 
as n Q approaches 16, which represents a blocklength 239 code 
with no error correction capability. Values of n Q greater than 
16 would completely overwhelm the code. 

The notion of an “equivalent” reduced-redundancy code 
for determining performance in the case of a total gap is useful 
but not completely accurate. The concatenated code’s overall 
error statistics depend on the statistics for both ungapped sym- 
bols and gapped symbols. The error probability formula of 
Eq. (8) for gapped symbols equals the “equivalent” reduced- 
redundancy code’s word error probability, which is greater 
than its symbol error probability. Thus, the performance 
degradation in the gapped zone due to the gap is worse than 
the degradation resulting from substituting the less powerful 
code. The overall average symbol error probability P s is ob- 
tained by averaging the results of Eqs. (8) and (9) over gapped 
and ungapped symbols, as in Eq. (6), and hence is somewhat 
higher than the symbol error probability of the intuitively 
“equivalent” reduced-redundancy code. A similar conclusion 
holds for the overall average concatenated code bit error 
probability P b , obtained from Eqs. (8) and (9) via Eqs. (4), 
(5), and (7). 
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VII. The Number of Gapped Symbols per 
Codeword 

The number of bits decoded by the Viterbi decoder during 
each gap period is RG. The total number of Viterbi decoded 
bits in an entire gap cycle is RT. This corresponds to RG/r 
channel symbols that are received during gaps, out of RT/r 
channel symbols received every gap cycle if the convolutional 
code’s rate is r (r = 1/2 for the Voyager code). Under the 
two-level model, approximately K - 1 (the memory length of 
the convolutional code) of the decoded bits during the gap 
period can be treated as having the same signal-to-noise ratio as 
ungapped bits. Thus, the effective length of each gap is reduced 
from RG bits to approximately RG - K + 1 bits due to the 
correction capability of the Viterbi decoder. On the other 
hand, the gap period is effectively lengthened by J - 1 bits, on 
the average, due to the ability of just one incorrect bit to cor- 
rupt an entire /-bit Reed-Solomon symbol. Thus, under the 
two-level model, the average number of gapped symbols in 
each A-symbol codeword is N(RG - K + f)/RT. This works 
out to an average of about eight gapped symbols per 255- 
symbol codeword for the Voyager-VLA parameters listed in 
Table 2. 

The periodicity of the gap cycle tends to guarantee that 
every Reed-Solomon codeword receives an average number of 
gapped symbols. However, there are certain conditions under 
which this conclusion is invalid. Some codewords can get more 
than their share of gapped symbols, while other codewords 
receive fewer. The codewords receiving too many gapped 
symbols are drastically more prone to error, as indicated by 
the rapid deterioration in the performance curves in Figs. 3 
to 6 as the number of gapped symbols n Q is increased. Thus, 
a small number of such atypical codewords can dominate the 
overall error performance of the concatenated code. 


A. Fluctuations Due to Symbol Edge Effects 

One basic mechanism causing an uneven distribution of 
gapped symbols per codeword is the symbol edge effects that 
on the average lengthen the effective gap by / - 1 bits. The 
actual lengthening of the gap can vary from 0 bits to 2/ - 2 
bits, depending on the “phase” of the gap edges relative to 
symbol boundaries. Figure 7 shows that each gap is length- 
ened by exactly (0j + 0 2 ) bits, where <f> x and 0 2 are the 
phases of the left and right edges of the effective gap relative 
to Reed-Solomon symbol boundaries. Both of these phases 
are uniformly distributed from 0 to / - 1 bits for a codeword 
picked at random. However, the two phases are not indepen- 
dent of each other, and in fact they must satisfy [(0j + 0 2 ) 
+ (RG - K + 1)] mod / = 0. Despite this dependence, the 
average of (0j + 0 2 ) is always / - 1 bits, because the aver- 


age of a sum of random variables equals the sum of the aver- 
ages even when the random variables are correlated. In 
general, (0 2 + 0 2 ) can assume either of two values, except that 
when (RG - K + 1) mod / = 1 , it must equal / - 1 bits regard- 
less of where the symbol boundaries fall relative to the gap 
edges. When (RG - K + 1) mod / ^ 1 , the two possible values 
for (0j + 0 2 ) are separated by exactly / bits. Thus, the actual 
lengthening of the gap due to symbol edge effects can be one 
of two values which differ by one symbol. 

For example, when (RG - K + 1) mod / = 0, the two possi- 
ble values for (0 t + 0 2 ) are 0 bits and / bits, and when 
(RG - K + 1) mod / = 2, the two possible values are / - 2 bits 
and 2/ - 2 bits. In the first example, (<p x + 0 2 ) = 0 with 
probability 1// and (<p l + 0 2 ) = / with probability 1 - 1//. In 
the second example, (0 5 + 0 2 ) - /- 2 with probability 1-1// 
and (0j + 0 2 ) = 2 / -2 with probability 1//. In both cases, the 
average value of (0j + 0 2 ) is / - 1 bits. However, the condi- 
tions in the second example will cause slightly poorer concate- 
nated code performance, because the worst-case lengthening of 
the gap is / - 1 bits greater than the average lengthening. On 
the other hand, the performance curves for the optimal data 
rates, which result in (RG - K + 1) mod / = 1, will suffer no 
additional degradation beyond that due to the average length- 
ening of the gaps. 

The additional degradation due to fluctuations in the length- 
ening of the gaps as a result of symbol edge effects is depicted 
in Fig. 8 as a function of the data rate R. In this figure the 
extra degradation is measured in terms of the worst-case num- 
ber of gapped symbols per gap relative to the average number. 
The worst-case number of gapped symbols per gap varies 
periodically with the data rate between a minimum value of 
(RG - K + /)// gapped symbols and a maximum value of 
(RG - K + /)// +1-1// gapped symbols. The period of 
this variation is J\G. For the Voyager-VLA parameters, the 
worst data rates occur nominally at 5 kbits/ sec, 10 kbits/sec, 
15 kbits/ sec, . . . , and the best data rates occur nominally at 
4.375 kbits/sec, 9.375 kbits/sec, 14.375 kbits/sec, .... The 
exact locations of the best data rates or the worst data rates 
are determined under the two-level model not only by the 
precisely measurable value of G, but also by the assumed 
effective shortening of each gap by exactly K - 1 bits due to 
the error correction capability of the convolutional code. 
Since the effective shortening of the gap is a fuzzy quantity, 
the absolute locations of the best or worst data rates cannot be 
determined precisely. Fortunately, the variation between the 
best and worst data rates is relatively small, because it is 
equivalent to creating less than one additional gapped symbol 
per gap. Other mechanisms causing fluctuations in the number 
of gapped symbols per codeword can cause much larger effects 
on performance. 
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B. Fluctuations Due to Incomplete Gap Cycles 
per Codeword 

A second mechanism that could cause some codewords to 
have an atypically large number of gapped symbols occurs 
when the total span of one block of interleaved codewords en- 
compasses one more gapped section of data than another 
block of interleaved codewords. One block of / interleaved 
codewords, each consisting of N J- bit symbols, spans a con- 
tinuous section of NIJ bits. If NIJ is an exact integer multiple 
of the gap cycle RT , than all interleaved codeword sets will 
include gapped symbols from exactly NIJ/RT different gap 
cycles. However, if NIJ/RT is not an integer, the span of an 
interleaved set of codewords will include a number of com- 
plete gap cycles plus a fraction of a cycle. If the extra frac- 
tional cycle includes the gap, the interleaved codeword set is 
“unlucky” and will suffer degraded performance, because its 
overall fraction of gapped symbols is larger than the nominal 
value of (RG - K + f)/RT. Other interleaved codeword sets 
are “lucky” and avoid the gap altogether in the fractional 
gap cycle, resulting in better performance than the nominal 
prediction. However, the overall unconditional concatenated 
code performance is dominated by the performance of the 
unlucky codeword sets, and so it is important to quantify 
how unlucky they can be. 

Figure 9 illustrates the effects of incomplete gap cycles 
per interleaved codeword block. The upper picture shows the 
case of an unlucky codeword set with a fractional gap cycle 
that includes a gap, and the lower picture shows a lucky 
codeword set whose fractional gap cycle misses the gap. In 
general, an “average” interleaved codeword block includes 
NIJ/RT gapped sections of data, but a lucky codeword block 
includes only L NIJ/RT J, while an unlucky codeword block 
includes L NIJ/RT J + 1, where Lx J represents the largest 
integer not exceeding x . Each gapped section of data includes, 
on the average, (RG - K + J)/J gapped symbols, which are 
distributed over the I interleaved codewords. The difference 
between the lucky codeword blocks and the unlucky ones is 
(RG - K + J)/IJ gapped symbols per codeword per gap. The 
difference between an unlucky codeword block and an aver- 
age one lies linearly (as a function of data rate) between 0 
and (RG - K + J)/IJ gapped symbols per codeword per gap, 
depending on how close NIJ/RT is to L NIJ/RT J or to 
L NIJ/RT J + 1. 

Figure 10 plots the peak-to-average concentration of gapped 
symbols due to incomplete gap cycles versus the data rate. The 
peak-to-average concentration of gapped symbols in the un- 
lucky codewords varies between 1 and 1 + RT /NIJ as NIJ/RT 
varies between successive integer values. The concentration 
factor returns to 1 periodically at reciprocal data rates sepa- 
rated by T/NIJ , but it rises to increasing maximum values 


between its returns to 1. For the Voyager-VLA parameters, 
the maximum value of RT/NIJ is less than 1/7 even at the 
maximum Voyager-Neptune data rate of 21.6 kbits/sec. Thus, 
the overall magnitude of the degradation caused by incom- 
plete gap cycles is limited to about one additional gapped 
symbol per codeword for Voyager. However, the effect of 
incomplete gap cycles can be very severe at higher data rates, 
namely data rates approaching NIJ/T (= 157 kbits/sec) or 
higher. 

C. Fluctuations Due to Gap Cycle/Interleaving 
Cycle “Resonances” 

A third mechanism that may cause an atypical concentra- 
tion of gapped symbols in some codewords is possible “reso- 
nances” between the gap cycle and the interleaving cycle. 
Even if every interleaved codeword block were to experience 
exactly the same proportion of gapped and ungapped periods, 
there can be a worst-case codeword within the interleaved 
block which gets more than its share of gaps. In the worst 
conceivable case, one unlucky codeword might receive all of 
the interleaved block’s gapped symbols, while the other / - 1 
codewords escape with no gapped symbols at all. This would 
result in a worst-case peak-to-average concentration of gapped 
symbols in one unlucky codeword by a factor of I. 

Some potential situations that may cause a concentra- 
tion of gapped symbols in one unlucky codeword are illus- 
trated in Fig. 11. In Fig. 11(a), the average effective gap 
period RG - K + J is small enough to fit within one symbol 
period /, and the distance between successive gap periods (the 
gap cycle RT*) is exactly an integer multiple of the interleaving 
cycle IJ. In this case, whichever of the / interleaved codewords 
includes the gap in its first symbol will also include all of the 
gaps contained within the entire span (NIJ bits) of the inter- 
leaved block. This unlucky codeword will receive / times its 
average share of gapped symbols, and the remaining/ - 1 code- 
words will have only ungapped symbols. 

Figure 11(b) illustrates a slightly different situation in 
which the effective gap period is still small, but the gap cycle 
RT is increased by enough to retard the occurrence of succes- 
sive gaps by one symbol. In this case, successive gaps hit con- 
secutive codewords, and all / codewords within the interleaved 
block receive a proportionate share of gapped symbols. Fig- 
ure 11(c) illustrates a case in which the gap cycle RT is made 
slightly longer, such that it retards the occurrence of succes- 
sive gaps by two symbols instead of one. Now, if the inter- 
leaving depth / is an even number, half of the interleaved code- 
words will get all of the gapped symbols, and the unlucky 
codewords will experience a peak-to-average concentration 
factor of 2. On the other hand, if the interleaving depth / is 
odd, the gaps will be distributed over all of the codewords in 
the interleaved block. 
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Figure 11(a) identifies a series of data rates R which 
cause major resonances between the gap cycle and the inter- 
leaving cycle. Specifically, the major resonances occur for 
values of R satisfying RT mod IJ = 0. Figure 1 1(c) shows that 
minor resonances can also occur when RT mod IJ =£ 0, if 
(RT mod //)// and / contain a common integer factor. At the 
major resonances, the peak-to-average concentration factor is 
/, while at the minor resonances the concentration factor is 
equal to the common integer factor of I and (RT mod//)//. 

The preceding conclusions about the magnitude of the 
peak-to-average concentration factor at the major and minor 
resonances are valid only for the types of cases depicted in 
Fig. 11, for which the gapped portion RG -K +J of the total 
gap cycle RT is small enough to fit within one codeword sym- 
bol. Two other possible cases are illustrated in Fig. 12. In 
Fig. 12(a), the effective gap length RG - K + / encompasses 
exactly two symbols, and the data rate is chosen to cause a 
major resonance between the gap cycle and the interleaving 
cycle (i.e., RT mod IJ = 0). In this case, the same two code- 
words are always hit by successive gaps, while the remaining 
1-2 codewords escape the gaps altogether. The resulting peak- 
to-average concentration factor in this case is 1/2. 

Figure 12(b) illustrates a situation in which the effective 
gap length (RG -K +/) is longer than one block of / inter- 
leaved symbols. The portion of the gap covering an integer 
multiple of IJ bits affects each of the I codewords equally, 
but the remaining portion of the gap covering a fraction 
of IJ bits afflicts one or more of the I codewords selectively. 
If the data rate is at a major resonance, the same codeword(s) 
will remain unlucky for all the gaps that occur throughout the 
entire span of the interleaved codeword set. The unlucky 
codeword(s) will receive an extra share of gapped symbols 
corresponding to the fractional portion of gapped bits, namely 
(RG - K + /) mod IJ. An average codeword should receive 
(RG - K + /)/// gapped symbols from each gap, but the lucky 
codewords receive only L (RG - K + /)/// J, while the un- 
lucky codewords receive L (RG - K + /)/// J + 1 (unless 
(RG - K + J) is exactly an integer multiple of//). The result- 
ing peak-to-average concentration of gapped symbols in the 
unlucky codewords varies between 1 and 1 + IJ/(RG - K + 1) 
as (RG - K + J) varies between successive integer multiples 
of IJ. 

The location of the major resonances depends on the length 
of the full gap cycle RT relative to the interleaving cycle //, 
while the magnitude of the performance deterioration at each 
major resonance depends on the effective length (RG - K + f) 
of the gapped portion of a gap cycle relative to the interleaving 
cycle IJ. These two effects are depicted separately in Fig. 13. 
The variation of the peak-to-average concentration of gapped 
symbols is shown as a smooth curve, while the resonance loca- 


tions are shown as sharp lines. The peak-to-average concentra- 
tions depicted by the smooth curve are valid only at or near 
the resonance locations. The resonances themselves are of non- 
zero width, but they are very narrow if the number of symbols 
N per codeword is large. 

Peak-to-average concentration factors of 1 .5 and greater are 
common at major resonances within the range of Voyager’s 
data rates. The maximum peak-to-average concentration factor 
varies between 4 and 2 for data rates between 4.375 and 
19.375 kbits/ sec. A peak-to-average concentration factor of 

1.5 corresponds to 12 gapped symbols per worst-case code- 
word rather than the average of 8. Figure 3 shows the poten- 
tial for catastrophic performance degradation of a VLA stand- 
alone system when the number of gapped symbols per code- 
word reaches 12 or 16 or higher. Thus, it is important that the 
precise Voyager data rates miss the location of the narrow 
resonances. Fortunately, this is the case. Table 3 lists the data 
rates causing major resonances in the range from 3.75 to 

22.5 kbits/ sec. Even though one of the Voyager data rates 
(21.6 kbits/sec) appears to fall perilously close to a major 
resonance, the unconditional performance evaluations in the 
next section show that the small separation is sufficient to 
avoid resonant degradation. 

VIII. Unconditional Concatenated Code 
Error Probability Curves for the 
Voyager-VLA Parameters 

The number of gapped symbols per codeword, n 0 , was eval- 
uated as a function of the relative phase between the gap cycle 
and the codeword, assuming the Voyager-VLA parameter 
values listed in Table 2. This evaluation simultaneously takes 
into account all of the types of fluctuations identified in the 
previous section. It was found that the Voyager data rates are 
all nonresonant, in the sense that the worst-case value of n 0 
was 9 or 10 and not 12, 16, or higher. 

The overall unconditional concatenated code bit error rate 
is obtained by averaging the bit error rate in Eq. (7), 

P b = E {P b } 00) 

where E { • } represents an average over the possible values of 
n 0 . Unconditional concatenated code bit error rates calculated 
from Eq. (10) are plotted in Figs. 14 through 17 for the same 
four VLA-Goldstone array ratios considered in Figs. 3 through 6. 

The concatenated code performance curves in Figs. 14 
through 17 are virtually identical for the three Voyager data 
rates shown, R = 7.2 kbits/sec, 14.4 kbits/sec, and 21 .6 kbits/ 
sec. Performance is very slightly improved at the higher rates. 
This negligible difference is attributable to the net effective 
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lengthening of the gap portion of the gap cycle by / - K = 1 
bit. This net lengthening constitutes a larger fraction of the 
total gap cycle at lower data rates than at higher rates, and 
hence the gap is predicted to affect the lower data rates 
slightly more adversely. However, as pointed out earlier, the 
net effective lengthening of the gap is the result of two com- 
pensating effects which almost cancel each other. The convo- 
lutional code's error correction capability makes the gaps look 
shorter at the low data rates, while edge effects due to entire 
Reed-Solomon symbols getting wiped out by one erroneous 
bit make the gaps look shorter at the high data rates. The 
model for both of these effects is fuzzy enough that the pre- 
dicted tiny performance improvement with increasing data 
rate is not significant. A more appropriate conclusion is that 
the predicted concatenated code performance is virtually inde- 
pendent of the Voyager data rate over the range R = 7.2 to 
21 .6 kbits/sec. 

The performance curves in Figs. 14 through 17 are essen- 
tially identical to the n Q = 9 conditional error rate curves in 
Figs. 3 through 6. This indicates that the unconditional error 
rate is almost completely determined by the error rate for 
codewords with the worst-case number of gapped symbols. At 
a constant performance level of 10~ 5 bit error rate, the net 
effect of the VLA gaps is to require 0.5 dB to 0.6 dB more 
signal-to-noise ratio for the VLA stand-alone system relative to 
an ungapped system. The net cost of the gaps when the VLA 
is arrayed equally with Goldstone (3-dB gaps) is 0.3 dB to 
0.4 dB. When the VLA’s contribution to the array is about 
twice Goldstone’s (5-dB gaps), the net cost of the gaps is 
almost the same as for the VLA stand-alone system, 0.5 dB. 
When Goldstone’s contribution is about twice the VLA’s 
(1.5-dB gaps), the net cost of the gaps shrinks to 0.1 dB to 
0.2 dB. 

The modest amount of deterioration in the concatenated 
code performance due to the VLA gaps is a consequence of 
the reserve error correction capability of the Reed-Solomon 
code relative to the average number of gapped symbols, cou- 
pled with the fortuitous choice of nonresonant data rates for 
Voyager. A remarkable example of catastrophic performance 
degradation due to a resonant data rate is shown in Fig. 18. 
The resonant data rate, R = 21.504 kbits/sec, differs from one 
of the important Voyager rates by less than 0.5%, and yet this 
case suffers an additional performance degradation of around 
2 dB for the VLA stand-alone system. The explanation is that 
21 .504 kbits/sec is a resonant data rate with a worst-case num- 
ber of gapped symbols per codeword equal to 16 (see Table 3), 
which exhausts the error correction capacity of the Reed- 
Solomon code. 

The effects of a resonant data rate are not so pronounced 
when the VLA is arrayed with Goldstone. Figure 19 shows 


concatenated code performance for the same resonant data 
rate considered in Fig. 18, but for the case of an equal VLA- 
Goldstone array ratio (3-dB gaps). The resonant data rate 
performance is only 0.1 dB to 0.2 dB worse than the perfor- 
mance for the nonresonant Voyager data rate at a required bit 
error probability of 10“ 5 . In fact, the concatenated code’s 
performance at high bit error rates (>10~ 3 ) is slightly better 
at the resonant rate than at the nonresonant rate. The reason 
for the dramatically improved performance is that, even 
though the Viterbi decoder’s error rate during the 3-dB gaps is 
truly bad, it does not decode completely random bits as it 
does when the gaps are totally devoid of received data. If the 
Viterbi decoder manages to decode a few gapped symbols 
correctly, every correctly decoded gapped symbol adds one 
symbol’s worth of reserve correction capacity to a Reed- 
Solomon decoder that would otherwise be operating with 
essentially no reserve capacity at all. 

The main lesson to be drawn from Figs. 14 through 19 is 
that the Voyager data rates and the VLA gap cycle parameters 
must be very accurately known and precisely controlled in 
order to avoid a disastrous resonance that would ruin the 
performance of a VLA stand-alone system. If this can be done, 
the overall concatenated code performance degradation due 
to the VLA gaps can be limited to about 0.5 to 0.6 dB. When 
the VLA and Goldstone are arrayed in equal ratio, the nominal 
degradation is reduced to 0.3 to 0.4 dB, but, just as signifi- 
cantly, the extra degradation at a resonant data rate is only a 
few more tenths of a dB. Thus, the necessity to avoid a reso- 
nant data rate is not quite so critical if the VLA is arrayed 
with Goldstone. 


IX. Summary 

Voyager’s Reed-Solomon outer code has sufficient error 
correction capacity to withstand the average number of 
erroneous symbols caused by the VLA data gaps. Of course, 
the code’s reserve capacity for correcting ordinary random 
errors not caused by the gaps is diminished and the overall 
concatenated code performance is slightly degraded relative 
to that of an ungapped receiving system. 

The periodicity of the VLA gap cycle tends to distribute 
an average number of gapped symbols to every Reed-Solomon 
codeword. However, several mechanisms were identified which 
can cause the actual number of gapped symbols to deviate 
from its benign average value for some unlucky codewords. 
These fluctuations are important because the overall perfor- 
mance of the concatenated code is dominated by its perfor- 
mance for the unluckiest codewords, which receive the worst- 
case number of gapped symbols. The mechanism causing the 
most serious fluctuations within the range of the Voyager- 
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VLA parameters is resonances between the VLA gap cycle 
and the Reed-Solomon codeword interleaving cycle. At many 
resonances within the range of Voyager’s data rates, the num- 
ber of gapped symbols included in unlucky codewords is 1.5 
to 4 times higher than the average number. The performance 
degradation at these resonant data rates can be catastrophic, 
especially for a VLA stand-alone system, as seen in Fig. 18. 


Fortunately, the resonances are very narrow and none of the 
actual Voyager data rates falls disastrously near a resonant 
rate. However, the existence of these catastrophic resonant 
rates inside the range of the actual Voyager data rates under- 
scores the importance of accurately knowing and precisely 
controlling all of the relevant code parameters, gap cycle 
parameters, and data rates for the Voyager-VLA system. 
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Table 1. The two-level model for the Viterbi decoder output statistics 



VLA-Goldstone array 

VLA stand-alone system 


Inside 
the gaps 

Outside 
the gaps 

Inside 
the gaps 

Outside 
the gaps 

Bit error rate 

p 0 

p i 

P 0 ~ 1/2 

Pi 

Symbol error rate 

*0 

"l 

n 0 * 1 

*1 

Signal-to -noise ratio 

pE b lN 0 * 


0 

E b IN 0 

* Values of the array ratio considered in this article 
p = 0 for the VLA stand-alone case). 

are: 10 log 1Q p = 

1.5 dB, 3 dB, 5 dB (as well as 


Table 3. Data rates causing major resonances between the gap 
cycle and the interleaving cycle 


Table 2. Code parameters, gap cycle parameters, and data rates 



General Case 

Voyager-VLA Case 

Convolutional code parameters 

Constraint length 

K 

7 

Memory length 

K- 1 

6 

Code rate 

r 

1/2 

Reed-Solomon code parameters 

Symbol size 

J 

8 

Codeword size 

N 

255 

Error correction capability 

E 

16 

Code rate 

1 - 2EIN 

223/255 

Interleaving depth 

I 

4 

Gap cycle parameters 

Gap length 

G 

1.6 msec 

Total gap cycle length 

T 

5/96 sec 

Data rate 

(Viterbi decoder bit rate) 

R 

21.6 kbits/sec 
14.4 kbits/sec 
7.2 kbits/sec 
3.6 kbits/sec 


Resonant data rate 
(kbits/sec) 

Worst-case number of 
gapped symbols 
per codeword 

4.3008 

37 

4.9152 

32 

5.5296 

29 

6.1440 

26 

6.7584 

24 

7.3728 

22 

7.9872 

20 

8.6016 

19 

9.2160 

17 

9.8304 

16 

10.4448 

15 

11.0592 

15 

11.6736 

14 

12.2880 

13 

12.9024 

13 

13.5168 

12 

14.1312 

12 

14.7456 

11 

15.3600 

11 

15.9744 

10 

16.5888 

10 

17.2032 

10 

17.8176 

9 

18.4320 

9 

19.0464 

18 

19.6608 

16 

20.2752 

16 

20.8896 

16 

21.5040 

16 

22.1184 

16 
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(a) ONE-LEVEL AVERAGE ERROR RATE MODEL ([2] , [3] ) 



(b) TWO-LEVEL AVERAGE ERROR RATE MODEL {PRESENT ARTICLE) 



(c) SIMULATED ERROR STREAM MODEL (NOT PRACTICAL) 


VITERBI DECODER 

SIMULATED 
ERROR STREAM 

REED-SOLOMON 
DECODER ANALYTICAL 

o nvi u la 1 1 u in w 1 1 n 
GAPPED DATA 


1 IUINO 

(MULTILEVEL MODEL) 
OR SIMULATIONS 


Fig. 1. Various analytical approaches to modeling concatenated 
coding with gapped data. 



CONVOLUTIONAL CODE E b /N 0 , dB 


Fig. 2. Viterbi decoder output error probabilities for ungapped 
system (taken from [5], Fig. 3-1). 
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INSIDE GAPS -oo 

CONCATENATED SYSTEM E b /N 0 , dB 

Fig. 3. Conditional concatenated performance for VLA stand-alone 
system as a function of n Q = number of gapped symbols per 
codeword. 



INSIDE GAPS -4 -3 -2 -1 

CONCATENATED SYSTEM E b /N Q , dB 

Fig. 4. Conditional concatenated code performance for VLA 
arrayed unequally with Goldstone (5-dB gaps) as a function of n Q = 
number of gapped symbols per codeword. 
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CONCATENATED CODE BIT ERROR PROBABILITY 



OUTSIDE GAPS 12 3 4 

INSIDE GAPS -2-1 0 1 


CONCATENATED SYSTEM E b /N Q , dB 


Fig. 5. Conditional concatenated code performance for VLA 
arrayed equally with Goldstone (3-dB gaps) as a function of n Q = 
number of gapped symbols per codeword. 


CONCATENATED CODE BIT ERROR PROBABILITY 



INSIDE GAPS -0.5 0.5 1.5 2.5 

CONCATENATED SYSTEM E^Nq, dB 


Fig. 6. Conditional concatenated code performance for VLA 
arrayed unequally with Goldstone (1.5-dB gaps) as a function of 
n Q = number of gapped symbols per codeword. 
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Fig. 7. Symbol edge effects. 


(RG-K+J)/J + 1-1/J -j 
(RG-K+J)/J 4 

" 'pi DATA RATE R 

Fig. 8. Fluctuations in the number of gapped symbols per gap due 
to worst-case symbol edge effects. 
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NOT INCLUDING 
GAP 


ONE INTERLEAVED BLOCK OF REED-SOLOMON CODEWORDS 

-J 

n SEVERAL VLA GAP CYCLES 


Fig. 9. Incomplete gap cycle effects. 




Fig. 10. Fluctuations in the peak-to-average concentration of gapped symbols per codeword 
due to incomplete gap cycles per interleaved block of codewords. 
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IJ BITS J BITS 



RT BITS 


(b) 


IJ BITS 


J BITS 



RT BITS 


il interleaved code symbols 

-lj LJ VLA GAP CYCLES 


<b) 


IJ BITS J BITS 




Fig. 12. Gap-cycle/ interleaving-cycle resonance effects for 
average effective gap lengths greater than one symbol. 


RT BITS 


(0 



IJ BITS J BITS 




RT BITS 


flSllii SlSSIISSSISSilSi II interleaved code symbols 

“ ■ VLA GAP CYCLES 


Fig. 11. Gap-cycle/interleaving-cycle resonance effects for 
average effective gap lengths up to one symbol. 



| | j | | | j | j | | ; J LOCATION OF MAJOR RESONANCES 

Fig. 13. Fluctuations in the peak-to-average concentration of 
gapped symbols per codeword due to gap-cycle/interleaving-cycle 
resonances. For clarity, figure depicts resonance spacing corre- 
sponding to T/G = 8. Actual T/G = 32.55 for the VLA parameters. 
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CONCATENATED CODE BIT ERROR PROBABILITY 



INSIDE GAPS -°o 


CONCATENATED SYSTEM E b /Ng, dB 


Fig. 14. Unconditional concatenated code performance at 
Voyager-Neptune data rates for VLA stand-alone system. 



INSIDE GAPS -4 -3 -2 -1 

CONCATENATED SYSTEM E^Nq, dB 

Fig. 15. Unconditional concatenated code performance at 
Voyager-Neptune data rates for VLA arrayed unequally with 
Goldstone (5-dB gaps). 
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CONCATENATED SYSTEM E b /N Q , dB 


Fig. 16. Unconditional concatenated code performance at 
Voyager-Neptune data rates for VLA arrayed equally with Goldstone 
(3-dB gaps). 



INSIDE GAPS -0.5 0.5 1.5 2.5 

CONCATENATED SYSTEM E b /N 0 , dB 

Fig. 17. Unconditional concatenated code performance at 
Voyager-Neptune data rates for VLA arrayed unequally with 
Goldstone (1.5-dB gaps). 
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INSIDE GAPS -*> 

CONCATENATED SYSTEM, E b /N Q , dB 

Fig. 18. Comparison of unconditional concatenated code per- 
formance at a Voyager-Neptune data rate and a nearby resonant 
data rate for VLA stand-alone system. 



INSIDE GAPS - 2-10 1 

CONCATENATED SYSTEM E b /N Q , dB 

Fig. 19. Comparison of unconditional concatenated code per- 
formance at a Voyager-Neptune data rate and a nearby resonant 
data rate for VLA arrayed equally with Goldstone (3-dB gaps). 
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Appendix 

Corroboration of the Two-Level Model 


The two-level model for the Viterbi decoder output statis- 
tics is an ad hoc model that was chosen for simplicity. It is 
relatively easy to analyze, and it affords easy separability of 
the analysis into a conditional evaluation of code performance 
and a computation of the gap cycle’s effect on the symbols in 
any given codeword. At the same time, it allows the gapped 
and ungapped portions of the gap cycle to be treated differ- 
ently, not just characterized by overall cycle averages as in the 
even simpler one-level model. 

The validity of the two-level model for the purposes of cal- 
culating concatenated code performance could be completely 
confirmed only by end-to-end tests or simulations of the 
entire concatenated code. Such simulations are impractical 
and were not performed. However, a partial corroboration of 
the model’s accuracy can be obtained by comparing the aver- 
age of the gapped and ungapped Viterbi decoder error rates 
predicted by the two-level models to the overall average 
Viterbi decoder error rates obtained from simulations of the 
Viterbi decoder operating over many VLA gap cycles. If the 
two-level model is accurate, the following equation should 
hold: 


RG-K + 1 
RT 


1 - 


RG-K + 1 
RT 


Pi (A-l) 


Here, p Q and p x are the gapped and ungapped bit error proba- 
bilities defined in Table 1 and Fig. 2 and used in Eqs. (4) and 
(5), and p is the simulated bit error rate (averaged over many 
gap cycles) used in Eq. (1) of this article and plotted in Figs. 3 
and 4 of [2] . 


Figures A-l and A-2 compare the left sides and right sides 
of the purported Eq. (A-l) for the VLA stand-alone case 
and for the VLA arrayed equally with Goldstone. 1 Fairly good 
agreement is obtained. Even the substantial variation of per- 
formance with data rate is somewhat accurately predicted by 
the two-level model. The agreement is best for the VLA 
stand-alone case. The gap conditions in this case are more 
nearly equal to the assumption of zero signal-to-noise ratio 
inside the gaps and infinite signal-to-noise ratio outside the 
gaps, from which the argument for effectively shortening the 
gaps by K - 1 bits was derived. 

A second form of justification for the two-level model is 
obtained by examining its validity at the end points. It was 
already stated in Section IV that the model is exactly correct 
in the extreme limit of zero signal-to-noise ratio inside the gaps 
and infinite signal-to-noise ratio outside the gaps. These ex- 
treme limits represent the maximum possible difference 
between the characteristics of the gapped and ungapped por- 
tions of the gap cycle. The opposite extreme occurs as the 
difference between the gapped zone and the ungapped zone 
goes to zero and the two signal-to-noise ratios become equal. 
The two-level model is obviously exactly correct in this ex- 
treme limit also, because Eqs. (2) through (7) reduce to 
Eq. (1) when p 0 = p x = p and n 0 = it x = 7r. These end-point 
arguments do not directly confirm the validity of the two-level 
model in the intermediate regions, but nonetheless they inspire 
confidence that it should not be too far wrong. 


^or comparison with [2] , VLA gaps are assumed to be 1 msec instead 
of 1.6 msec for the curves in Figs. A-l and A-2. 
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OUTSIDE GAP 0 1 2 3 4 OUTSIDE GAP 0 12 3 4 

INSIDE GAP -°o -oo -oo -oo INSIDE GAP -3 -2-1 0 1 

CONVOLUTIONAL CODE E^/Nq, dB CONVOLUTIONAL CODE E b /Ng, dB 

Fig. A-1. Comparison of Viterbi decoder performance predicted by Fig. A-2. Comparison of Viterbi decoder performance predicted by 

two-level model and by detailed simulation for VLA stand-alone two-level model and by detailed simulation for VLA arrayed equally 

system. (Note: For comparison with [2], VLA gaps are assumed to be with Goldstone (3-dB gaps). (Note: For comparison with [2], VLA 

f msec instead of 1.6 msec for these curves.) gaps are assumed to be 1 msec instead of 1.6 msec for these 

curves.) 
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A new Viterbi decoder ; capable of decoding convolutional codes with constraint 
lengths up to 15, is under development for the DSN . The objective is to complete a 
prototype of this decoder by late 1990 , and demonstrate its performance using the (15, 
1/4) encoder in Galileo. The decoder is expected to provide 1 dB to 2 dB improvement in 
bit SNR , compared to the present (7,1/2) code and existing Maximum-Likelihood Con- 
volutional Decoder (MCD). The new decoder will be fully programmable for any code up 
to constraint length 15, and code rate 1/2 to 1/6. This article describes the decoder 
architecture and top-level design. 


I. Introduction 

The DSN uses concatentated codes to reduce the Bit Error 
Rate (BER) on the telemetry channel from deep space probes 
to the DSN complexes. Standard coding, as used for the Voy- 
ager mission, consists of an outer (255,223) Reed Solomon 
(RS) code and an inner convolutional code with constraint 
length K = 7, and code rate 1/2. Decoding is accomplished by 
a Maximum-Likelihood Convolutional Decoder (MCD), fol- 
lowed by an RS decoder. A typical telemetry chain is shown in 
Fig. 1 . Performance of this coding scheme is well understood 
[ 1 ].[ 2 ]. 

Recently [3] , new convolutional codes have been dis- 
covered that provide a “2-dB coding gain” over existing codes. 
The highest gain, 2.1 1 dB, is achieved by using a (15,1/6) con- 
volutional code, concatenated with a (1023,959) RS code. 
Using (15,1/6) convolutional codes with a (255,223) RS code 
results in an estimated coding gain of 1.8 dB. This gain can be 
realized by building a new Viterbi decoder for the inner code, 
and using the existing RS decoder. Hence, employing the 
newly discovered convolutional codes can result in relatively 
inexpensive improvement in DSN telemetry performance. 


To demonstrate the new codes, an encoder for a (15,1/4) 
convolutional code is being added to the Galileo spacecraft. 
A rate 1/4 code is used instead of a rate 1/6 code, because of 
the limited bandwidth available on the Galileo modulator. This 
encoder, shown in Fig. 2, requires only a small number of 
parts (20 integrated circuits and 60 discrete components) and 
thus has a minimal impact on spacecraft complexity. A proto- 
type decoder is being developed, capable of decoding Galileo 
data, but also of accepting other codes such as DSN standard 
codes and (15,1/6) convolutional codes. Figure 3 shows the 
BER versus bit SNR, for various coding schemes, with a 
predicted coding gain for the Galileo experiment of 1 .5 dB. 

The complexity of a Viterbi decoder depends on three key 
parameters: constraint length (i.e., degree of the generating 
polynomials), code rate (i.e., reciprocal of the number of en- 
coded symbols transmitted for each information bit), and infor- 
mation data rate. The major complexity driver is constraint 
length, since the amount of hardware is roughly proportional 
to the number of states, which is where K is the con- 

straint length. Hence a decoder for K - 1 5 is approximately 
256 times more complex than a decoder for K- 7. Such a 
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complex decoder can be built with current VLSI technology 
within reasonable size limitations. 

This paper describes the prototype decoder. Section 2 out- 
lines system requirements, and Section 3 describes the top- 
level design. Section 4 describes in detail the architecture 
of the processor assembly, the unit performing the actual 
decoding. 


II. Decoder Requirements 

Requirements for the decoder can be separated into three 
categories: 

(1) Performance. The decoder will process convolutionally 
coded data with constraint length up to 1 5 (program- 
mable) and code rate 1/2 to 1/6 (programmable). Data 
rate must meet Galileo requirement (134.4 Kbit/sec), 
with a goal of 1 Mbit/sec. The decoder will utilize a 
synchronization pattern, if it is present in the uncoded 
data stream, to support node synch. In addition, an 
external node synch input will be available. 

(2) Interfaces. The decoder will provide DSN interfaces, 
for testing in CTA 21 and for integration into DSN 
complexes. At a minimum these include symbol input 
from the Symbol Synchronizer Assembly (SSA) or the 
Base-Band Assembly (BBA), decoded information bits 
to the Frame Synchronization Subsystem (FSS), and 
interfaces to station monitor and control. 

(3) Testability. The decoder will include testing capability 
for both stand-alone tests and DSN compatibility tests. 
In the stand-alone test, the decoder will generate a 
pre-programmed information bit sequence, encode it 
according to the desired convolutional code, add a pro- 
grammable amount of white Gaussian noise to the 
symbols, pass the noisy symbols to the decoder proper 
(processor assembly), compare the decoded bits to 
the original sequence, and compute BER, in real time. 
The decoder will also provide GO or NO GO indication 
to the operator. For DSN compatibility testing the de- 
coder will receive a symbol stream, and an un-encoded 
bit stream, decode the symbols, and compute BER. 

Additional requirements concerning operating environment, 
size, power consumption, reliability, fault testing, and main- 
tainability exist, but are not discussed here. 1 


! J. Statman, “Draft Task Plan for Large Constraint Length VLSI Viterbi 
Decoder,*’ JPL IOM 331-87.5-241 (internal document), December 28, 
1987. 


III. Top-level Design 

A functional block diagram of the decoder is shown in 
Fig. 4. The following is an overview of these blocks: 

(1) Processor Assembly. This is the “heart” of the decoder. 
It consists of 256 identical VLSI chips that perform 
the maximum-likelihood decoding of the incoming 
symbol sequence. In addition, this assembly includes 
path memory, metric normalization circuitry, and the 
applicable computer, timing, and control interfaces. 

(2) Simulator Assembly. The simulator assembly generates 
a noisy symbol sequence in three steps. First, an infor- 
mation sequence is generated. Next, this sequence is 
encoded using the appropriate convolutional encoder. 
Finally, a measured amount of noise is added. In addi- 
tion, the simulator assembly sends the uncoded infor- 
mation sequence to the comparator assembly, to 
enable performance evaluation. 

(3) Comparator Assembly. This assembly receives “true” 
information bits from either the simulator assembly 
or from an external input, and decoded bits from the 
processor assembly. It aligns the sequences and collects 
BER data. 

(4) Node Synch Assembly. The node synch assembly 
derives node synch either from the rate of metric 
increase, from an embedded synch pattern, or from an 
external source. 

(5) Erasure Signal Generator. This is an option under con- 
sideration. It is based on an algorithm [4] that com- 
pares the incoming symbols to an encoded version of 
the decoded information bits, to determine probable 
burst-error locations. 

(6) SSA Interface. This module converts the signal coming 
from the SSA to signals compatible with the decoder. 
The two key operations are adjustment of voltage 
levels and removal of additional sign inversions added 
by some encoders. 

(7) FSS Interface. This module sends the decoded bits to 
the FSS, similar to the existing MCD output. 

(8) Other DSN Interfaces. More DSN interfaces are under 
definition. Options are interface to the Telemetry 
Processor Assembly (TPA) and interfaces to future 
DSN data network via Small Computer Standard Inter- 
face (SCSI) bus for data transfer, and General Purpose 
Instrumentation bus (GPIB) for monitor and control. 
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(9) Computer-Controller-Timing. The computer-controller- 
timing coordinates the modules described above by 
providing command, control, and monitor operations 
during initialization and decoding. In addition, it gen- 
erates all the required timing signals and allows for 
extensive stand-alone testing. 

The prototype decoder packaging approach is to provide 
for easy transfer from prototype packaging to a DSN-ready 
system. Standard DSN packaging techniques are used where 
possible. The baseline package is in two drawers, mountable 
in a 19-inch rack. The first drawer is based on a MULTIBUS I 
card cage and includes all the assemblies and external inter- 
faces, except for the processor assembly. The second drawer 
includes the processor assembly. 


IV. Processor Assembly Architecture 

The architecture presented here is for a particular imple- 
mentation of the Viterbi decoder. We start by reviewing sev- 
eral basic definitions and algorithms that are used elsewhere in 
this article. It is not intended as a Viterbi decoder tutorial, and 
the interested reader may read references [1], [2], [5], [6] 
for further information. 

The Viterbi decoder tries to find the best possible match 
between a stream of received symbols and a path through a 
state trellis. The processing is sequential, i.e., using the set of 
symbols corresponding to a single information bit, the decoder 
progresses from one time-slice through the trellis to the next, 
while updating its decision on the most likely path and the 
resulting decoded bits. For a code with constraint length AT, 
the number of states is 2 A: “ 1 , so in the K = 15 decoder there 
are 16384 states. 

We assume here that the code rate is 1 In. This implies that 
each state is connected to two preceding states and to two suc- 
ceeding states, depending on whether the preceding and suc- 
ceeding information bits are 0 or 1 . In fact, it is convenient to 
organize the states in butterflies (so called because the graph 
of associated arithmetic resembles a butterfly). Each butterfly 
contains two states, and has inputs from two other butterflies 
and outputs to two other butterflies. For K = 15, the 16384 
states are organized in 8192 butterflies. 

The data exchanged between the butterflies are accumu- 
lated metrics. These metrics represent the probability of trellis 
paths, i.e., the lower the accumulated metric, ihe more likely 
is the path. There is one accumulated metric per state, or two 
per butterfly. Accumulated metrics are computed inside the 
butterfly. For each set of symbols corresponding to an infor- 
mation bit, the butterflies add the existing accumulated met- 


ric to the metric associated with the new symbols (so called 
“branch metric”), resulting in new accumulated metrics. As 
time passes, accumulated metrics grow, so periodically they 
are reduced down, or normalized. 

A. Basic Trade-Offs 

Several implementation choices were made and are docu- 
mented below. First, the 8192 butterflies can be implemented 
using serial or parallel architectures, or with a hybrid serial- 
parallel approach. In a serial architecture, a single physical 
butterfly processor performs all 8192 butterflies, sequentially. 
In a parallel architecture, 8192 physical butterflies are used. 
In a hybrid approach, n physical butterflies are used, each 
sequencing through 8192 In butterflies. The fully parallel 
architecture was chosen. 

Next, a choice of arithmetic method is made. The arith 
metic operations include addition, subtraction, and compari- 
sons between metrics. The decoder uses integer arithmetic and 
performs bit-serial arithmetic , or bit-by-bit operations. In this 
approach, the metrics (represented by 8- to 18-bit numbers) 
are sent serially, on a single wire, LSB to MSB. A separate 
TDA Progress Report article is under preparation, describing 
the bit-serial versus parallel arithmetic trade-offs. 

Next, the method for decoder graph partitioning is selected. 
Butterfly interconnection can be represented by a graph with 
8192 nodes, where each node corresponds to a butterfly. 
Each node has inputs from two other nodes and outputs to 
two other nodes. The partitioning selected will be described in 
detail in a future progress report. It is a two-level partitioning 
of the graph, where the first-level subgraphs correspond to 
printed circuit boards, while secondlevel subgraphs correspond 
to VLSI chips. Key features of the partitioning are (a) the 
graph is split among 16 identical boards, each with 16 identical 
VLSI chips, leading to easy implementation, (b) any Viterbi 
decoder of constraint length K can be built by wiring together 
2 (*“ 7 ) 0 f these chips or of these boards, and (c) the 

number of wires between boards and chips is relatively small. 

B. Processor Assembly Elements 

The processor assembly, shown in Fig. 5, consists of six 
major functions: 

(1) Symbol Conversion. The symbols arriving into the 
processor assembly are 8-bit 2’s complement quanti- 
ties, arriving at the rate of one symbol per symbol 
clock. The symbol conversion module buffers the sym- 
bols into blocks that correspond to information bits 
(using the node synch signal), converts the symbols 
into sign-magnitude values, and rearranges the sym- 
bols for bit-serial transmission to the butterflies. It also 
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computes the sum of the magnitudes of the six sym- 
bols and transmits it to butterflies, bit-serially, LSB 
first. 

(2) Butterflies. The butterflies are the core of the decoder. 
As shown in Fig. 6, each butterfly consists of two main 
blocks: an Add-Compare-Select (ACS) unit and a Met- 
ric Computer. The ACS uses four adders to add branch 
metrics to accumulated metrics, then compares the 
sums to select two of them for further transmission. 
The metric computer uses a set of adders to compute a 
weighted sum of the received symbols. Both the ACS 
and the metric computer are mathematically specified 
below. The complete decoder for K- 15 has 8192 
butterflies, 32 butterflies per VLSI chip. 

(3) Metric Exchange. The metric exchange function is per- 
formed by the interconnections between butterflies. 
Some of the metrics are exchanged inside the VLSI 
chip, while others are sent via wires between chips and 
in a backplane. All transmitted metrics must be kept 
aligned, i.e., the zth bit of transmitted metric is present 
on all metric exchange wires at the same clock period, 
regardless of the form of this connection. 

(4) Traceback Memory. After each butterfly completes the 
ACS operation it sends two bits to the traceback mem- 
ory. These two bits per butterfly (computed once per 
information bit) represent the results of the two ACS 
select operations. The traceback memory can be viewed 
as a matrix where one dimension is the number of 
states, 16384, and the other dimension corresponds to 
time, and has 3*1*K entries. For K =15, the memory 
has at least 16384 *3*7*15 bits, or approximately 
640 Kbytes. 

(5) Traceback Processor. The traceback processor reads 
and writes the traceback memory to produce decoded 
bits [4] ? 

(6) Normalization Processor. The normalization processor 
monitors several accumulated metrics. When any of 
these metrics exceeds a computer-selected threshold, 
a normalization command is issued to the butterflies, 
to be executed during the next information bit time. 


1. Add-Compare-Select. A diagram of an ACS is shown in 
Fig. 7. The accumulated metrics (16-bits) from neighboring 
states /0 and zl, which were previously computed in some 
other ACS unit, are added bit-serially to the branch metrics 
2> 00 , ^oi * ^10’ anc * ^ii> P rovi ^ed by the metric computer unit. 
The operation produces the sums: 


*oo = W IO + 4 00 


s l0 = m n +b io 


01 


m i0 + b 01 


s 


11 


m n +b n 


These sums are shifted into four shift registers and the 
smaller sum of each pair is selected by the comparators, as 
follows: 

if Ooo < s io>’ m i0 = *00 and bit0 = °. 

otherwise m^ Q = s 1Q and bitQ = 1 

if (*0i < *n)’ w /i = *0) andWn = 0, 

otherwise and bit l = 1 

Here, m /0 and are the output accumulated metrics, and 
bit 0 and bit 1 are the results of the decisions, sent to the trace- 
back memory. 

2. Branch Metric Computer. The branch metrics are com- 
puted in the metric computer (Fig. 8) from the six received 
symbols, r 0 . . .r s . The implementation here is slightly differ- 
ent from that found in the literature, resulting in reduction of 
the dynamic range of the branch metrics by a factor of 2 [4] . 
Let r t be represented as sign and magnitude binary numbers, as 
follows: 


r t = ( s r r i 6 ’ r is’ r i4’ r i3 ’ r n’ r n> r io) 


C. Butterfly Mathematical Representation and 
Implementation 

The following paragraphs describe the equations of the ACS 
and the metric computer unit. 


2 F. Pollara and H. Shao, “Memory Management in Traceback Viterbi 
Decoders,” JPL IOM 331-87.2-242 (internal document), February 12, 
1987. 


where s ( are the sign bits. Let (e Q , e t , . . . , e 5 ) be the label 
assigned to the butterfly at initialization. This label is the out- 
put of an encoder making one of the state transitions of the 
butterfly. Because the generator polynomials of the codes con- 
sidered have leading and trailing ones, each butterfly has only 
two possible branch metrics, and they sum to a constant (for a 
fixed set of symbols). Let 

c t = e. © s. i = 0, . . . , 5 
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then 


b oo = Z ? i 

ieA 

where A is the set of all f s such that c A - 1 , and r i are the mag- 
nitudes of the r/s. Also, b 10 =x- b 00 , where x is 

* = Z ? i 

i= 0 

Finally, for codes with a leading and trailing one in the gen- 
erator polynomials, b l{ = b 00 and b 01 = b 10 . This condition is 


met for our codes with K = 15. If K < 15, we have b n = b 10 


V. Conclusions 

A new DSN Viterbi decoder is under development that 
benefits from two recent Advanced Systems developments: 
the successful search for long constraint length codes which 
yield a “2-dB coding gain,” and the VLSI expertise in the 
Communications Systems Research Section. The top-level 
design, mathematical characterization, and functional speci- 
fications have been completed. The decoder is expected to be 
ready for testing using a Galileo encoder by late 1990. 
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Fig. 2. A (15, V«) convolutional encoder for Galileo. 


139 












BIT ERROR RATE, BER 


SIGNAL-TO-NOISE RATIO (PER INFORMATION BIT), dB 


_3 -2 -1 0 1 2 3 4 5 6 7 8 9 10 11 12 



THEORETICAL 

LIMIT 


EXPERIMENTAL 
(15, 1/4) CODE 
FOR GALILEO 


CURRENT 
(7,1/2) CODE 
(VOYAGER, 
GALILEO) 
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Fig. 4. Decoder functional block diagram. 
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Decoding results are described for long decoding runs of Galileo ’s convolutional codes. 
A 1-kbit/ sec hardware Viterbi decoder is used for the ( 15 , 1/4) convolutional code , and 
a software Viterbi decoder is used for the ( 7, 1/2) convolutional code. The output data of 
these long runs are stored in data files using a novel data compression format which can 
reduce file size by a factor of 100 to 1 typically . These datafiles can be used to replicate 
the long , time-consuming runs exactly and are useful to anyone who wants to analyze the 
burst statistics of the Viterbi decoders . The 1-kbit/ sec hardware Viterbi decoder has been 
developed in order to demonstrate the correctness of certain algorithmic concepts for 
decoding Galileo *s experimental (15, 1/4) code, and for long-constraint-length codes in 
general The hardware decoder can be used both to search for good codes and to measure 
accurately the performance of known codes. 


I. Introduction 

Many long decoding runs of 5 to 40 Mbits have been per- 
formed for Galileo’s experimental (15, 1/4) convolutional 
code and Galileo’s standard (7, 1/2) convolutional code. A 
1-kbit/sec hardware Viterbi decoder 1 is used for the (15, 1/4) 
convolutional code, and a software Viterbi decoder on the 
RT0P71 Sun-3/260 computer is used for the (7, 1/2) convo- 
lutional code. The output data from these long runs are stored 
in data files using a novel data compression scheme of retain- 
ing only the decoded ones and storing them in hexadecimal 
form. With this format a typical output compression of 100 to 
1 is achieved. These data files can be used to replicate the 
long, time-consuming runs exactly and are useful to anyone 
who wants to analyze the burst statistics of the Viterbi 
decoders. 


1 C. R. Lahmeyer, “The 1 Kilobit per Second Viterbi Decoder,” Inter- 
office Memorandum 331-88.3-042, Jet Propulsion Laboratory, Pasa- 
dena, California, August 19, 1988. 


A 1 -kbit/sec hardware Viterbi decoder was developed in 
the past few months in order to demonstrate the correctness 
of certain algorithmic concepts in the decoding of long- 
constraint-length convolutional codes. At present this decoder 
is designed to decode any convolutional code of constraint 
length 15 and with code rate \/n as low as 1/6. Most of the 
recent test runs have used the (15, 1/4) convolutional code that 
the Galileo project has selected [1]. It has the following 
generator polynomials: 

46321 = 100 110 011 010 001 
51271 = 101 001 010 111 001 
63667 = 110011 110 110 111 
70535 = 111 000 101 Oil 101 

Since the fast hardware Viterbi decoder is at present con- 
figured to decode convolutional codes of constraint length 
15 only, a software Viterbi decoder was developed in the 
RTOP71 Sun computer to perform long decoding runs for 
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Galileo’s standard (7, 1/2) convolutional code. The standard 
code has the following generator polynomials: 

133 = 1 Oil Oil 
171 = 1 111 001 

The initial motives for performing the long decoding runs 
were to facilitate the study of the proper interleaving depth 
for the Reed-Solomon code used by Galileo, and to develop 
theoretical models (e.g., the geometric burst model [2]) for 
the decoded output of the Viterbi decoder, from which 
concatenated code performance can be accurately estimated 
without directly simulating the entire concatenated system. 
These decoding run outputs represent a data base useful to 
anyone studying the burst nature of the output error patterns 
of the Viterbi decoders. 

II. Description of Test Setup of the 
Hardware Decoder 

While the (7, 1/2) convolutional code is decoded entirely 
in software, the long decoding runs for the (15, 1/4) convolu- 
tional code require the use of the above-mentioned hardware 
decoder in combination with software. The test configuration 
used for the decoding runs is shown in Fig. 1. The hardware 
decoder interfaces with a PC-compatible computer. Software 
in the PC generates the test data and transmits it to the hard- 
ware decoder, where the most computationally intensive part 
of the decoding, called metric computation, is performed. 
The PC then performs the part of the decoding process called 
traceback, in order to complete the decoding. The PC then 
condenses the decoded bits into the compressed form and 
archives the results to hard disk. 

The generation of the source data is performed by a soft- 
ware noise generation routine in the PC. Gaussian noise sym- 
bols are generated in software at a user-selectable noise level. 
The noise symbols are quantized to 8-bit sign-magnitude 
representation. The information content of this data is assumed 
to be all zeros; thus, any nonzero decoded bits represent 
decoding errors. This is the usual convention when running 
the decoder to test code performance, and it is theoretically 
justified because the code is linear and because it has been 
shown that the decoder does not favor zeros in any way. 

III. Data Representation Scheme 

A typical method of representing decoded output is to 
print ASCII l’s and 0’s for all the decoded bits, but such an 
approach would produce a 5-Mbyte DOS file for a 5-Mbit 
decoding run. Therefore a compact representation scheme was 
developed which preserves all of the information about the 
decoder output but reduces file size by a factor of 100 to 1 
typically. This scheme relies on the fact that the information 


content is all 0’s, and thus the vast majority of the decoded 
bits will be 0’s, with only a few l’s representing decoding 
errors. Using a scheme somewhat like spacecraft image com- 
pression, only the “changes,” or in this case the error bursts, 
are printed. These are represented in hexadecimal notation. 

Figure 2 is a sample printout of a decoding run at a 0.45-dB 
signal-to-noise ratio (E b /N 0 ). The first column is a decimal 
representation of the number of bits between the start of this 
burst and the start of the previous one. The first burst started 
at bit number 0 and the second burst starts 381 bits later. 
Following this is a hexadecimal representation of the error 
burst itself. For example, 9100 represents a burst of three 
error bits given as 1001000100000000 in binary. The defini- 
tion of the end of a burst is a string of at least 16 bits which 
are all 0’s. Four 0’s are printed at the end of each burst to act 
as a delimiter between bursts and to signify the 16 zero bits. 
This definition is somewhat loose in that some of the last 
bits of the printed burst can also be 0’s, but no information 
is lost in any case. All decoding errors will ultimately be 
listed once each. The line with -1 signifies the end of the 
decoding run. Thereafter follow some statistics about the 
entire run. Most are self-explanatory, with Pb representing 
the bit error rate and Ps signifying the symbol error rate, 
i.e., the fraction of Reed-Solomon symbols (8 decoded bits) 
that are corrupted. The size of the entire file “bursts.45” is 
46 kbytes. 

Table 1 lists all the files accumulated so far for the Galileo 
code given above. Represented here are several runs of 5 Mbits 
and a few at 20 Mbits or more. The filename indicates the 
noise level used in that run, e.g., “bursts.45” signifies a run 
with E b /N 0 = 0.45 dB. 

Table 2 lists files recently generated by software decoding 
of the NASA standard (7, 1/2) code used by Voyager, Galileo, 
and other missions. In the filename, “7” signifies the con- 
straint length and the next digits give the noise level. For 
example, “nburst7.1 .4” signifies a run with the (7, 1/2) code 
at E b /N 0 = 1 .4 dB. 

IV. Conclusions 

A library of decoded output data from long decoding runs 
of Galileo’s convolutional codes has been started. Some early 
runs in this collection are listed here, and it is anticipated that 
many more runs with different codes and sample sizes will be 
performed in the future. Any of the files referenced in Tables 
1 and 2 can be made available to interested users on request. 
Useful applications of this work have already been obtained in 
the study of how the experimental Galileo convolutional code 
performs when concatenated with the 8-bit (255, 223) Reed- 
Solomon code [3] . 
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Table 1. Decoding runs to date with the (IS, 1/4) 
convolutional code 


Filename 

Total bits decoded 

bursts. 0 

5 Mbits 

bursts. 12 

5 Mbits 

bursts.22 

5 Mbits 

bursts. 3 

600 kbits 

bursts.32 

5 Mbits 

bursts.42 

5 Mbits 

bursts.45 

5 Mbits 

bursts. 5 

22 Mbits 

bursts. 6 

20 Mbits 

bursts. 7 

40 Mbits 


Table 2. Software decoding runs for the (7, 1/2) 
convolutional code 

Filename 

Total bits decoded 

nburst7.1.4 

10 Mbits 

nburst7.1.45 

5 Mbits 

nburst7.1.5 

5 Mbits 

nburst7.1.55 

10 Mbits 

nburst7.1.6 

5 Mbits 

nburst7.1.65 

10 Mbits 

nburst7.1.7 

5 Mbits 

nburst7.1.75 

10 Mbits 

nburst7.1.8 

5 Mbits 

nburst7.1.85 

10 Mbits 

nburst7.1.9 

40 Mbits 

nburst7.2.0 

40 Mbits 





Fig. 1. Configuration for decoding with 
1 -kbit/sec Viterbi decoder. 
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Fig. 2. Sample decoding run output (from file “bursts.45”). 
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The Galileo spacecraft employs concatenated coding schemes with Reed-Solomon 
interleaving depth 2. This article compares the bit error rate (BER) performance of 
Galileo's concatenated codes , assuming different interleaving depths (including infinite 
interleaving depth). It is observed that Galileo's depth 2 interleaving ; when used with the 
experimental (15, 1/4) code, requires about 0.4 dB to 0.5 dB additional signal- to-noise 
ratio to achieve the same BER performance as the concatenated code with ideal inter- 
leaving. When used with the standard (7, 1/2) code, depth 2 interleaving requires about 
0.2 dB more signal-to-noise ratio than ideal interleaving. 


I. Background 

The Galileo spacecraft employs a communication system 
which uses either a (7, 1/2) convolutional code or a (15, 1/4) 
convolutional code as the inner code, and a (255, 223) Reed- 
Solomon code as the outer code. By using soft, maximum- 
likelihood decoding on the received symbols, the convolu- 
tional codes perform well at low signal-to-noise ratios. How- 
ever, maximum-likelihood decoding of convolutional codes 
creates bursty errors. An interleaver is placed between the 
convolutional code and the Reed-Solomon code to randomize 
the bursty errors before they are fed to the Reed-Solomon 
decoder. 


Concerns were initially expressed last summer 1 ’ 2 and re- 
cently repeated 3 [1] about the adequacy of Galileo’s interleav- 
ing depth for the constraint length 15 code, even when it was 


1 S. Dolinar, “Alternative Code Considerations for Galileo,” JPL Inter- 
office Memorandum 331-87.2-308 (Appendix), (internal document), 
Jet Propulsion Laboratory, Pasadena, California, July 1, 1987. 

2 S. Lushbough, “Galileo Convolutional Coder Meeting with Project, 
31 July 1987,” JPL Interoffice Memorandum 3 1 3/5-18 l-SL:jlc, 
(internal document), Jet Propulsion Laboratory, Pasadena, California, 
August 3, 1987. 

3 L. Swanson, “Interleaving Depths for Reed-Solomon Decoders,” 
JPL Interoffice Memorandum 331-88.2-042, (internal document), 
Jet Propulsion Laboratory, Pasadena, California, July 13, 1988. 
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first mistakenly assumed that Galileo’s interleaver was the same 
as Voyager’s. Depth 4 interleaving was selected for Voyager to 
sufficiently randomize the error bursts created by the (7, 1/2) 
convolutional decoder. Performance degradation for the (7, 1/2) 
code with depth 4 interleaving is insignificant (less than 0.1 dB) 
relative to ideal interleaving at bit error rates between 10“ 4 5 and 
10~ 6 . However, the error bursts from the (15, 1/4) decoder are 
about twice as long (on the average) as the bursts from the 
(7, 1/2) decoder, and thus the longer-constraint-length code 
would seem to require about double the interleaving depth. In- 
stead, Galileo’s actual interleaving depth is only half of Voya- 
ger’s, and this can potentially cause significant concatenated 
system performance degradation for both of Galileo’s codes 
relative to theoretical predictions based on ideal interleaving. 

Previous studies of the effects of interleaving depth on con- 
catenated system performance included some test data for the 
(7, 1/2) code 4 but no in-depth analyses that would allow 
extrapolation to the case of the (15, 1/4) code. Direct simula- 
tion tests of concatenated system performance using the 
(15, 1/4) code were completely unfeasible because of the huge 
amount of data that would have to be collected to verify bit 
error rates (BERs) in the 10” 5 to 10 -6 range, and because of 
the slowness of the software Viterbi decoder simulation (about 
30 hours of CPU time on a Sun-3/260 computer per 100,000 
decoded bits for the (15, 1/4) code). 

Recently the completion of C. R. Lahmeyer’s 1 -kbit/sec 
(currently constrained to run at about 0.1 kbit/sec-still a 
hundredfold increase in speed relative to last summer’s soft- 
ware simulation) hardware Viterbi decoder 5 has allowed us to 
do some long decoding runs not previously feasible for the 
(15, 1/4) code. With the advent of this hardware decoder, the 
following research tools have been developed: 

(1) Long decoding runs (several megabits) for the (15, 1/4) 
convolutional code were performed on the hardware 
decoder, and the error bursts are stored in data files 
conforming to the data compression format described 
in recent memos. 6 * 7 These data files can be used to 


4 F. H. J. Taylor, “Project Galileo Orbiter/Deep Space Network Com- 
munications Design Document,*’ Project Document 625-257, Jet Pro- 
pulsion Laboratory, Pasadena, California, pp. 5-12, January 15, 1981. 

5 C. Lahmeyer, “1 Kilobit per Second Viterbi Decoder,” JPL Inter- 
office Memorandum 331-88.3-042, (internal document), Jet Propul- 
sion Laboratory, Pasadena, California, August 19, 1988. 

6 C. Lahmeyer and K. Cheung, “Long Decoding Runs for the (15, 1/4) 
Convolutional Code,” JPL Interoffice Memorandum 331-88.3-040, 
(internal document), Jet Propulsion Laboratory, Pasadena, California, 
July 27, 1988. 

7 K. Cheung, “More Long Decoding Runs of the Convolutional Codes 
(including the (15, 1/4) convolutional code and the (7, 1/2) convolu- 
tional code),” JPL Interoffice Memorandum 331-88.2-048, (internal 
document), Jet Propulsion Laboratory, Pasadena, California, August 1, 
1988. 


replicate the long, time-consuming runs exactly and are 
useful to anyone who wants to analyze the burst 
statistics of the Viterbi decoder. 

(2) Similar long decoding runs were performed for the 
(7, 1/2) convolutional code using the software simu- 
lation, and the error bursts from those runs are also 
saved in the compressed format. 

(3) Simulation software was developed which reads the 
compressed burst data obtained from the long decod- 
ing runs and simulates the operation of the entire 
concatenated coding system with different interleav- 
ing depths. 

II. Performance Results 

Simulated BERs of concatenated coding systems with vari- 
ous interleaving depths are given in Figs. 1 and 2. Figure 1 
shows the performance of the concatenated system using the 
(15, 1/4) convolutional code as the inner code, and Fig. 2 
shows the performance of the concatenated system using the 
(7, 1/2) convolutional code. The BERs of the concatenated 
coding systems with finite interleaving depths are compared to 
the BERs with ideal interleaving (infinite interleaving depth). 
In both figures, the concatenated code BER is shown as a 
function of two bit-energy-to-spectral-noise-density ratios, 
E b IN 0 - 

(1) for the convolutional code alone and 

(2) for the overall concatenated system. 

The difference between the two E b /N Q scales is the overhead 
of 0.58 dB accounting for the redundancy of the outer (255, 
223) Reed-Solomon code. 

The data points plotted in Figs. 1 and 2 were obtained from 
a series of long decoding runs varying in length from 5 million 
to 40 million decoded bits. Smooth curves were fitted through 
the data points corresponding to BERs greater than 10~ 5 . 
The In statistical uncertainty in the data points at BERs lower 
than 10^ 5 is more than 100% for the cases of finite interleav- 
ing depth. The corresponding uncertainty in the data points 
at BERs higher than 10~ 5 ranges from about 5% to about 
100% of the simulated BER. The la uncertainty in the data 
points for ideal interleaving is between 10% and 30% of the 
BER. 

An example illustrates the difficulty of obtaining accurate 
simulated concatenated system performance at low BERs. 
The rightmost data point in Fig. 1 (interleaving depth 2, con- 
volutional code E b /N 0 = 0.7 dB) required about 120 hours 
(5 days) of running time on the hardware decoder to decode 
40 Mbits. The same decoding run would have consumed 
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1.4 years of CPU time on the software decoder. Furthermore, 
the entire 40-Mbit run produced only three observed code- 
word errors, and the la statistical uncertainty in the simulated 
BER is around 100%. 


III. Performance Comparison 

Tables 1 and 2 show the minimum convolutional code 
E b IN Q to achieve Galileo’s concatenated and unconcatenated 
system performance requirements assuming ideal (infinite- 
depth) interleaving and depth 2 interleaving, respectively. 
Table 3 shows the performance degradation caused by depth 2 
interleaving relative to ideal interleaving for Galileo’s two 
alternative convolutional codes. Galileo’s experimental (15, 
1/4) code requires about 0.4 dB to 0.5 dB additional signal-to- 
noise ratio to overcome the insufficiencies of depth 2 inter- 
leaving and achieve concatenated code BERs between 10 -5 and 
10~ 6 . Galileo’s standard (7, 1/2) code is hurt less by depth 2 
interleaving, but still suffers about 0.2 dB degradation. The 
relative performance advantage of the (15, 1/4) code over the 
(7, 1/2) code is reduced by about 0.2 dB to 0.3 dB from the 
amount predicted in earlier studies (e.g., [1]) based on ideal 
interleaving. With depth 2 interleaving, concatenated system 
performance will be improved by only about 1.2 dB when the 
(15, 1/4) code is substituted for the (7, 1/2) code. The corre- 
sponding improvement for an unconcatenated system or for a 
concatenated system with ideal interleaving is between 1.4 dB 
and 1 .5 dB. 


IV. Conclusions and Recommendations 

Galileo is unfortunately stuck with depth 2 interleaving, 
and so the immediate consequence of our simulations is simply 
to quantify the amount of expected degradation for Galileo’s 
concatenated codes. However, future missions should select 
interleaving schemes that produce minimal degradation. The 
required interleaving depth increases roughly in proportion to 


the constraint length of the inner convolutional code. For 
example, interleaving depth 8 appears sufficient to keep the 
degradation under 0.1 dB for the (15, 1/4) code, as does inter- 
leaving depth 4 for the (7, 1/2) code. 

As alternatives to simply increasing the interleaving depth 
of conventional block interleaving schemes, new techniques to 
combat bursty errors, such as convolutional interleaving, heli- 
cal interleaving, and burst forecasting, should also be investi- 
gated. These techniques are superior to conventional block 
interleaving schemes. Also, a new Reed-Solomon decoder 
which can correct both errors and erasures is being developed 
in the Communications Systems Research Section at JPL. It is 
expected that the performance of concatenated systems will 
be substantially improved by the use of error-forecasting 
techniques together with erasure-correcting Reed-Solomon 
decoders. We propose to investigate the possibility of develop- 
ing better interleaving schemes for future deep space missions 
and to analyze the performance of concatenated coding 
systems using these new interleaving schemes. 

Even though the hardware Viterbi decoder has allowed us 
to simulate many million decoded bits at a time, the data is 
still insufficient to accurately simulate concatenated code 
performance at BERs less than about 10~ 5 . The amount of 
data required for an accurate estimate increases in proportion 
to the interleaving depth, and so it is even more difficult to 
simulate directly the performance of deeply interleaved 
schemes than it was for Galileo’s interleaving depth 2. Hence, 
notwithstanding the recent advance in decoding speed, it is 
still important to develop theoretical models for the decoded 
output of the Viterbi decoder, from which concatenated code 
performance can be accurately estimated without directly 
simulating the entire concatenated system. The geometric 
burst model of [2] should be reexamined for applicability tc 
long-constraint-length codes, and new models need to be 
developed for estimating Reed-Solomon code performance 
based on the theoretical model for the Viterbi decoder output. 
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Table 1. Minimum E b /N 0 to achieve concatenated and uncon- 
catenated system performance requirements under ideal inter- 
leaving assumption 



(7, 1/2) code 

(15, 1/4) code 

Difference 

Unconcatenated 

BER = 5 X 1CT 3 

2.02 dB 

0.52 dB 

1.50 dB 

Concatenated 

BER = 1CT 6 

1.79 dB 

0.33 dB 

1.46 dB 

Concatenated 
BER = 10~ s 

1.70 dB 

0.27 dB 

1.43 dB 


Table 2. Minimum 
catenated system 
depth 2 

E b /N Q to achieve concatenated 
performance requirements for 

and uncon- 
interleaving 


(7, 1/2) code 

(15, 1/4) code 

Difference 

Unconcatenated 
BER = 5 X 10~ 3 

2.02 dB 

0.52 dB 

1.50 dB 

Concatenated 
BER = 10~ 6 

2.01 dB 

0.83 dB 

1.18 dB 

Concatenated 
BER = 1(T S 

1.90 dB 

0.69 dB 

1.21 dB 


Table 3. Concatenated system performance degradation for inter- 
leaving depth 2 versus ideal interleaving 


(7, 1/2) code 

(15, 1/4) code Difference 

Concatenated 
BER = 1(T 6 

0.22 dB 

0.50 dB 0.28 dB 

Concatenated 
BER = 1(T S 

0.20 dB 

0.42 dB 0.22 dB 


CONCATENATED CODE BIT ERROR RATE CONCATENATED CODE BIT ERROR RATE 
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Reed-Solomon (RS) codes form an important part of the high-rate downlink telemetry 
system for the Magellan mission , and the RS decoding function for this project will be 
done by the DSN. Although the basic idea behind all Reed-Solomon decoding algorithms 
was developed by Berlekamp in 1968, there are dozens of variants of Berlekamp ’s algo- 
rithm in current use. This paper attempts to restore order by presenting a mathematical 
theory which explains the working of almost all known RS decoding algorithms. The key 
innovation that makes this possible is the unified approach to the solution of the key 
equation , which simultaneously describes the Berlekamp, Berlekamp-Massey , Euclid , and 
continued fractions approaches. Additionally , a detailed analysis is made of what can 
happen to a generic RS decoding algorithm when the number of errors and erasures 
exceeds the code's designed correction capability , and it is shown that while most pub- 
lished algorithms do not detect as many of these error-erasure patterns as possible, by 
making a small change in the algorithms, this problem can be overcome. 


I. Decoding Reed-Solomon Codes 

In this article we will give a general definition of Reed- 
Solomon codes, state the abstract Reed-Solomon decoding 
problem, describe the two main classes of decoding algorithms 
(time- and frequency-domain decoders), and then give three 
theorems. Theorem 1 explains why the RS decoding algo- 
rithms work, and Theorems 2 and 3 delineate exactly what 
happens if the number of errors and erasures exceeds the 
codes’ designed correction capability. The article concludes 
with three appendices, which give the mathematical back- 
ground needed for the proofs of the theorems presented. 


Let F be a field which contains a primitive n th root of 
unity a. (We assume that the characteristic of the field does 
not divide n.) If L and r are fixed integers between 0 and «, 
the set of all codewords (vectors) C = (C 0 , . . . , C n _ x ) over 
F such that 

n - 1 

£ CV' = 0 / = L,L + \ L+r-l (1) 

i=0 

is called a Reed-Solomon code. The parameters n y r, and L are 
called the length , redundancy , and offset of the code. The 
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parameters k ~ n - r is called the code’s dimension. (Com- 
monly L = 0, 1, or (n - r + l)/2.) The polynomial g(x), 
defined by 

L+r-l 

g(x)= y \ ( x_a/ ) 

i=L 

is called the code’s generator polynomial. Note that C is a 
codeword if and only if the generating function for C, viz., 
C(x ) = C Q + C x x + * * * + C n _ 1 x n ^ 1 is a multiple of g(x). If n 
is odd and r is even, and L = (n - r + l)/2, then the roots of 
g(x) come in reciprocal pairs or*), for; (r/2) 

- 1, and#(x) is a “palindrome.” 

The Zras/c metric property of an RS code is that any two 
codewords must differ in at least r + 1 positions. Thus if 
d H (C f C') denotes the Hamming distance between the code- 
words C and C', and C ^ C', then it follows that 

d H (C,C')>r+\ (3) 

The basic combinatorial property of an RS code is that given 
any subset I C (0, 1, . . . , n - 1} of at most k coordinate posi- 
tions, and an arbitrary set {a. : i E 1} of elements from F, 
there exists an RS codeword C such that C i = a j for all 
i E I. (Proofs of these basic properties can be found in [2] , 
Section 7.3.) 

Suppose we transmit a codeword C over a channel, which 
can, on occasion, change any symbol from F into any other, 
and which in addition can “erase” any symbol, i.e., make it 
completely unintelligible. To model erasures, we introduce an 
“erasure symbol” * and add it to F : F = F U {*}. Thus we 
send a codeword, and receive a vector R = (R 0 , R 1 , . . . , R n ^ x ) 
from F n . The RS decoding problem , ideally, would be this: 
given REF”, find the nearest RS codeword. However, that 
proves to be too hard, and we must be satisfied with the solu- 
tion to an easier but closely related problem. 

To state the decoding problem precisely, we must define a 
distance between vectors over F, the RS decoding distance . If 
V = (F 0 , . . . , V n ^i) and V' = (Kq, . . . , V* n ^ x ) are vectors with 
components in F, we define 


^(V.v)-f (4) 

1=0 


where if x and y are elements of F, 

{ 0 if x = y 

1 if x # y and either x or y is * 

2 if x =£ y but neither x nor y is * 

(5) 

One way to think about this_metric is shown in Fig. 1, for 
F = GF(3). The elements of F are the vertices of a graph, with 
every element of F connected to * by an edge. Then ^^(x,^) 
is just the distance between x and y in the graph. Note that if 
V and V' are vectors with components in F (i.e., with no *s) 
we have 

dfcsCV.V') = 2rf H (V f V') (6) 

The decoding problem for an RS code of redundancy r can 
now be stated. Given an _arbitrary vector R = (R 0 , . . . , 
with components from F, find all RS codewords C such that 

d RS (C,R)<r (7) 

First, note that there can be at most one codeword C such that 
Eq. (7) holds. This is because if ^rs(C, R) < r and <iRs(C', R) 
< r, then by the triangle inequality 

<W C > c ') < <W C > R > + c ') < 2r 

which implies by Eq. (5) that d H (C, C') < r, violating Eq. (3), 
unless C = 

We now describe an efficient algorithm, essentially due to 
Elwyn Berlekamp ([1], Chapter 7), for solving the decoding 
problem. 

For a given R, its erasure set I 0 is defined as 

/„ ={/:*,=*} t 0 = \I 0 \ ( 9 ) 

(The notation I S I denotes the number of elements in the set 
S.) The decoder’s first step is to calculate the erasure locator 
polynomial a 0 (x), defined by 

° 0 ( x ) = n (!-«'*) ( io > 

/e / 0 
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(If there are no erasures in R, a 0 (x) is defined to be 1.) If the 
number of erasures f 0 exceeds r, there can be no solutions to 
Eq. (7); in this case the decoder should simply print “too 
many erasures!” and stop. We will assume that t Q < r in the 
rest of the discussion of the decoding algorithm. 


(see Appendix C). The polynomial a x (x) is traditionally called 
the error locator polynomial , and c o(x) is called the error -and- 
erasure evaluator polynomial . Now the decoder multiplies 
a 0 (x) and cr^x), obtaining a polynomial o(x ), called the 
erasure/error locator polynomial. 


Once the erasure locator polynomial is calculated, the de- 
coder replaces the *s in R with symbols from F. Usually these 
symbols are chosen to be Os, but if the decoder has “side infor- 
mation” about the original values of the C { s corresponding to 
the erased indices i E 7 0 , these values can be used. In any case, 
the result is a new vector R' = ( R ^ , . . . , ), defined by 


Once the polynomials a(x) and cj(x) are known, there are 
two essentially different ways to complete the algorithm. 
These are usually called the time-domain approach and the 
frequency-domain approach. 

The time-domain approach can be described by the follow- 
ing pseudocode fragment. 


*; = 


R. if/ £ I n 


if / ei n 


( 11 ) 


where Z ( = 0 is the usual choice. 

Next, the syndrome is computed, i.e., the r values 

n-l 

S f = for/ = L,L + 1, . . . ,L + r - 1 (12) 

1=0 

which are used as coefficients in the syndrome polynomial 


S ^ = S L +S L + l X+ '- + S L +r -l Xr ' 1 03 ) 

If erasures are present the decoder continues by calculating the 
modified syndrome polynomial S 0 (x), defined by 


S 0 (x) = S(x)o Q (x) (mod x r ) (14) 


Now comes the key step. Define the numbers p and v by 


A* = L(r-t 0 )l 2J 
v = r(r + r 0 )/2l- 1 


(15) 


(If x is a real number, lx\ is the greatest integer less than or 
equal to x, and fxl is the least integer greater than or equal to 
x.) It is an easy exercise to show that p + v - r - 1. The de- 
coder now solves the (x r , 5 0 (x), p , v) problem, i.e., it finds the 
unique lowest degree pair of polynomials cjj(x) and u>(x) such 
that deg^) < p, deg(cc) < v , and 


o x (x)S Q (x) = co(x) (mod x r ) 


I* Time domain fragment */ 

{ 


if (o 0 = 0 or deg(co) > t Q + deg(a 1 )) 
decode = FALSE; 
else { 

count = 0; 

for (i = 0 to n - 1) { 

if(a(a"0 = 0 and o (cr^^O) 
count = count + 1 ; 


E t = 


a ( orO 


{ 


else 


E { = 0; 

} 

if (count = deg(a)) 
decode = TRUE; 
else 

decode = FALSE; 

} 


After execution, if “decode” is “TRUE,” C = (C 0 , . . . j), 

where C t = Rj - E t for / = 0, 1 , . . . , n - 1 is the unique code- 
word within RS distance r of R. On the other hand, if “de- 
code” is “FALSE,” the decoder just prints the warning “no 
codeword within RS distance r.” All early RS decoders used 
an algorithm much like this; such an algorithm is described as 
a “hybrid decoder” in Figure 9.2 in Blahut [2] . 

The frequency-domain approach can be described by the 
following pseudocode fragment. (In this listing, d denotes the 
degree of a(x).) 
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I 


/* Frequency Domain Fragment */ 

{ 

if (a 0 = 0) 

decode = FALSE; 
else { 

decode = TRUE; 

for (j = L + r to n + L + d - 1) 


d 

..s 

0 ;t=i 

for(j=n + Lton + L + d- 


s , ■ - h t 


decode = FALSE; 
break; 


1) 


} 

} 

if (decode = TRUE) 
for (i = 0 to n - 1) 


Proof: First, we suppose there is a codeword C = (C 0 , . . . , 
C n _ t ) within RS distance r of R. We will show that conditions 
(A), (B), and (C) are satisfied* To do this, we define / 0 , a 0 (x), 
and R' as in Eq. (11) (the erasure fills Z { can be arbitrary). 
Next, we define the error s.et I x and error locator polynomial 
o x (x) as 



(16) 

o,(x) = n o -<***) 

(17) 

and the error- and- erasure pattern as E = (E 0 , . . 
where 

• > 1)> 

E. - - C. for / = 0, 1 

(18) 

Finally we define the error-and -erasure set I and the error-and - 
erasure locator polynomial o(x ) by 


(19) 




/= 0 


The decoder now finishes exactly as the time-domain de- 
coder did. The “frequency-domain decoders” described in 
[2, Figure 9.2] and the decoder described in [5] follow this 
general description. (The “time-domain decoder” described in 
Figure 9.7 in [2] is a rare example of an RS decoding algo- 
rithm which is apparently not closely related to the descrip- 
tions in this section. See Whiting [6] for a survey of Reed- 
Solomon decoding algorithms.) 


In each case, the algorithm will locate the codeword within 
RS distance r of R, if there is one, and will print the message 
“no codeword within RS distance r” if there is not. The fol- 
lowing theorem explains why. 

Theorem 1 . There is a codeword within RS distance r of R if 
and only if the following three conditions are satisfied: 


a(*) = n(l-“'*) (2°) 

It follows from Eqs. (12), (18), and (1) that the syndromes 
Sj satisfy 


n - 1 

S. = ^ E. a l i for/ = L, . . . , L + r - 1 (21) 

z-0 


which implies that 

v -zV)'-'' 

1=0 


for/ = 0 r - 1 


(22) 


Thus S L , S i+1 , . . . , S L+ r _ j are the first r components of the 
DFT V = (V 0 , . . . , V n _ 1 ) of the “twisted error pattern” V = 
(K 0> F, ^-i), defined by 


V. = E. <x iL for / = 0, . . . , n - 1 (23) 


(A) 

deg (w (*))< t Q + deg (Oj(x)) 

It follows then from Eqs. (13) and (22) that if we define V{x) 

(B) 

aj(0)#0 

= V 0 + V 1 x + --- + Kh.jX"-!, then F(x) = S(jc) 

(mod x »•), 

and indeed, if we define V 0 {x) = a 0 (x) V(x) mod 

jc'.that 

(C) 


<£> 

II 

(24) 
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where S Q (x ) is defined in Eq. (14). If we now define the error- 
and-erasure evaluator polynomial as 

wW = ( 25 ) 

ie/ 

where cr*(x) = cr(x)/(l - a*x), (compare this to Eq. (B-5) in 
Appendix B), it follows from Theorem B-6 in Appendix B that 

o{x)V{x ) - co(x)(l -x") (26) 

and so, since a(x) = a 0 (x) a^x) and V Q (x ) = a 0 (x) F(x), 

O) V 0 (x) = w(x) (mod x r ) (27) 

Furthermore, since C is assumed to have RS distance r or less 
from R, it follows that t Q + 2 deg(aj) < /*, which in turn im- 
plies degfaj) < p, and deg(co) < deg (a) = t 0 + deg(aj) < 
where p and y are defined in Eq. (15). Furthermore, o x and co 
are relatively prime, since by Lemma B-2 in Appendix B for 
each i E I x , co(flrO =£ 0. Therefore (a x , w) is the solution to 
the (x r , V 0 (x\ p , y) problem, which by Eq. (24) is the same as 
the (x r , S 0 (x), p, v) problem. Thus the polynomials produced 
by the decoding algorithm must be the error locator poly- 
nomial and the error-and-erasure evaluator polynomial, and 
these polynomials satisfy conditions (A), (B), and (C): Equa- 
tion (25) implies (A); Equation (10) implies (B); Equation 
(20) implies (C). 

To complete the proof, we suppose that* conditions (A), 
(B), and (C) are satisfied. We will show that this implies that 
there is a codeword within RS distance r of R. To do this we 
define a(x) = a Q (x) Oj(x); note that condition (A) says that 
deg(to) < deg(a), and condition (C) says that o(x)|(l -x"). 
Hence by Theorem B-5 in Appendix B, there exists a vector 
V = (V 0 , V x , . . . , V n _ l ) and a support set / for V such that 

a(x) = Xa 7 (x) (28) 

W(x) = Xojy j(x) (29) 

We claim now that the vector C = (C 0 , . . . , C n _ x ), defined by 

C. = R\ - V.*- iL for / = 0, 1 (30) 

is a codeword within RS distance r of R. First we show that 
R) ^ tr. This is because R has t Q erasure symbols, and 
apart from these, differs from C only in those indices i for 
which V { =£ 0, i.e., a.(cr*) = 0. But by condition (C), o x has 
exactly deg(aj) roots in {1, a, ... , a"" 1 }, and deg(a 1 )<M, 
and so 


£? rs ( c , R) = t Q + 2 deg (ffj) < t 0 + t 0 

+ 2 L(/*- ? 0 )/2J (31) 

All that remains is to show that C, as defined in Eq. (30), is a 
codeword. Since o x and cc(x) solve the (x r , £ 0 (x), p, v) prob- 
lem, we know that 

aj(j()S 0 (x) = w(*) (mod x r ) (32) 

But by Eq. (14), S 0 (x) = S(x) o 0 (x) mod x r ; and since o(x) = 
a 0 (x) aj(x), by Eq. (32) we have 

v(x)S(x) = co(x) (modx r ) (33) 

On the other hand, by Eqs. (28) and (29), together with Theo- 
rem B-6 in Appendix B, we have 

cr(x)^(x) = cu(x) (modx r ) (34) 

Now gcd(a 0 (x), x r ) = 1 (see Eq. 10), and condition (B) guar- 
antees that gcd(aj(x), x r ) = 1, and so gcd(a(x),x r ) = 1. Thus 
by Eqs. (33) and (34) we have 

S(x) = V(x) (modx'') (35) 

Equating coefficients of x' for / = 0, 1 , . . . , r - 1 on both sides 
of Eq. (35), we see that 

S j+L = V. for / = 0, 1 , . . . ,r - 1 (36) 

But this implies that 

n-1 n-1 

^ R'i <**' = 2 V i ot ' iLaii for; = L L + r - 1 

/=0 i~0 

(37) 

which says that C, as defined by Eq. (30), is a codeword. ■ 

With the help of Theorem 1 , we can now explain why the 
time-domain and frequency-domain decoders work. First, we 
discuss the time-domain decoder. The first line checks condi- 
tion (A) and (B) of Theorem 1. The “for” loop checks condi- 
tion (C), by evaluating the polynomial o(x) for x = or*, for 
/ = 0, 1 1 . Notice that there is a check for a (or 1 ) = 0; 
this is necessary, since as we will see, it is possible for o(x) to 
have a double root. The formula for E t follows from Eq. (26) 
and Corollary B-7 in Appendix B, and the fact that E t = FJer Ll 
(see Eq. 23). 
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Next, we consider the frequency-domain decoder. The first 
line checks condition (B) of Theorem 3. The first “for” loop 
extends the sequence S L’ S L + 1 > • * • >$L+r-l recursively, using 
o(x ) as the characteristic polynomial, and the second “for” 
loop checks to see whether or not this extension has period n. 
If the sequence is not periodic, then either condition (A) or (C) 
must fail, by the first part of Theorem B-10 in Appendix B. 
On the other hand, if the sequence is periodic, then 


tion (B) anyway since it is so easy to do so; if (B) fails, no fur- 
ther work is necessary. Theorems 2 and 3 show conditions (A) 
and (C) are independent, however, so that they both must be 
checked. 

Theorem 2. For any word R E F n , the polynomials cr^x) 
and co(x) must satisfy the following three conditions: 


d 

£ o k S hk = 0 for all j > L + d 

k-0 

and so if u(x) = + S L+ 1 X + • * • , then cj ( x ) u(x) has de- 

gree < d. But a(x) w(x) = oo(x) (mod x r ), and so deg(co) < 
d , i.e., condition (A) is satisfied. Next, since condition (B) 
insures that gcd(a 1 , co) - 1 , the second part of Theorem B-10 
shows that condition (C) holds. Finally, the formula given 
for the error vector E i follows from the fact that (S L , . . . , 
S L+n _ i ) is the DFT V of the twisted error V vector defined in 
Eq. (23). This follows from the basic Theorem B-6, which says 
that the components of V satisfy the homogeneous difference 
equation whose characteristic polynomial is a(x). 

All published RS decoding algorithms correctly locate the 
codeword within RS distance r of the received word, if there is 
one. However, almost all of these algorithms (including all 
algorithms in [2], [3], [5], and [6]) can fail badly when 
there is no such word. By this we mean that there usually exist 
words R which are not within RS distance r of any codeword, 
and yet which cause the decoder to output a vector C rather 
than to print the message “no RS codeword within RS dis- 
tance r.” One naive way to avoid this problem is simply to test 
any word produced by the decoder to see if it is a codeword, 
and then to see if it is within RS distance r of the received 
word. However, this method is not quite foolproof (division 
by zero is possible in either the time- or frequency-domain 
approaches), and more complex than necessary. 


(D) 

deg(Oj) < ii 

(E) 

deg(w) < v 

(F) 

gcd(Oj,w) = x 1 


where x* is the highest power of x dividing ^(x). 


Proof: Conditions (D) and (E) follow from the definition of 
the (a, b, fx, v) problem. Condition (F) follows from Lemma 
C-3 in Appendix C. ■ 

Theorem 3. Conversely, given a set I Q of t 0 erasure locations 
and any pair of polynomials a x (x) and co(x) satisfying condi- 
tions (D), (E), and (F), there exists a vector R G F n , and a 
choice of “erasure fills” Z / (see Eq. 11) which will produce 
Xaj(x) and Xco(x) as the solution to the (x r , S 0 (x), /r, v) 
problem. Indeed, if Oj(x) and <o(x) are relatively prime, let 
S = [S 0 , S t , . . . , S n _ l ] be any vector such that 

‘ S L + 5 L + l JC + -" + 5 L + r-l X ''' 1 = ^ m0d *'' 


(38) 


where a(x) = a Q (x) ^(x), and if the vector R' = [^q, . . . , 
R , n ^ 1 ] is defined to be the inverse DFT of S, i.e., 


The difficulty is that the decoders typically do not check 
all of the conditions (A), (B) and (C) of Theorem 1 . The next 
two theorems explain why it is essential to make this check. 
Theorem 2 gives conditions on the polynomials and co that 
must always be satisfied, whether R is within RS distance r of 
a codeword, or not. Theorem 3, on the other hand, shows that 
the conditions imposed on o x and oo in Theorem 2 are suffi- 
cient to guarantee the existence of a vector R which will pro- 
duce these polynomials. Together, these two theorems show 
that there are Rs that will produce a x s and cos satisfying some 
but not all of conditions (A), (B), and (C). Actually, condition 
(C) implies condition (B), but it is worthwhile to check condi- 


R\ = -TS. cr" 

1 n J 


/=* o 


fori = 0, 1 


(39) 


and if R is defined by 


*/ = 



if' ? f 0 
if/ e i 0 


(40) 


then if the RS decoding algorithm is applied to R, (using the 
components of R' to fill in the erasures) a t (x) and co(x) will 
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be the error locator and error-and-erasure evaluator polynom- 
ials. Similarly, if gcd(aj , c o) = x* with / > 0, any S satisfying 


S L +S L+ 1 X + 


+ Wi*" M = 


W(£) 

a(x) 


modx 


r-J 


(41a) 




°x,+r-/-r 


Appendix C and the example following it, (a x , go) is the solu- 
tion to the ( x r , ^(x), p, v) problem. Now if the decoding 
algorithm starts with R, and fills in the erasures to produce R', 
by Eq. (39) the syndrome polynomial S (*) will be S L + S L+1 X 
+ ■ ■ ■ + S^+r- 1*'* 1 > which, by Eq. (38), is the same as 
t o(x)/o(x) mod x r . Thus the modified syndrome o Q (x ) S(x ) 
will be S 0 (x) = co(x)/o 1 (jc) mod x r , and, as we have seen, this 
means that the decoding algorithms will produce ^(x) and 
cj(x) as the error evaluator and error-and-erasure evalua- 
tor polynomials. The case when gcd(a 1? go) =£ 1 is handled 
similarly. ■ 


mod x r ~ /+ 1 

o(x) 

(41b) 

again with R ' and R defined as in Eqs. (39) and (40), will do. 

Proof: We distinguish two cases: gcd(a 1? co) = 1 and 
gcd(a x , co) =£ 1. If gcd(aj , go) = 1, and 5 0 (x) is defined to be 
the polynomial c o(x)/o l (x) mod x r , then by Theorem C4 in 


Corollary. If t Q < k f then there is a vector R 0 that produces 
(aj , co) as the error locator polynomial and error-and-erasure 
evaluator with zeros as the erasure fills. 

Proof: Let R be the vector defined by Eq. (40), and let C 
be any RS codeword that agrees with R on the set 7 0 . (There 
will be such a codeword, by the basic combinatorial property 
of RS codes, since k > f Q .) Then the vector R = R 0 - C will 
have the same syndrome as R 0 , and has zeros on the erasure 
set I Q . ■ 
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Appendix A 

The Discrete Fourier Transform 


Let f be a field which contains a primitive nth root of 
unity a. (If the characteristic of F is finite, we assume that it 
does not divide n.) We first note 

n-l 

1 -x n = j - [ (1 -a*x) (A-l) 

i-0 

This is because the polynomials on both sides of Eq. (A-l) 
have degree n , constant term 1 , and roots of*, for / = 0, 1 , . . . , 
n - 1. 

Next, let 

V = (V 0 ,V 1 V n _ x ) (A-2) 

be an ^-dimensional vector over F, and let 

V = V n -i) ( A ’ 3 > 

be its discrete Fourier transform (DFT), defined by 
^ n ~ x 

V. = ^ V.a iJ for / = 0, 1, . . . , n - 1 (A-4) 

/=o 


The components of V can be recovered from those of V via 
the inverse DFT 

1 n ~ l ^ 

V. = — for / = 0, 1 1 (A-5) 

n /' = o 

If we interpret the components of V and V as the coefficients 
of polynomials, i.e., if we define 

V(x) = V Q + V x x + • • • + V„_ 1 x n ~ 1 (A-6) 

and 

F(jc) = V 0 + V x x + • • • + V n _ 1 x n ~ 1 (A-7) 

then the DFT and IDFT relationships, Eqs. (A4) and (A-5) 
become 

V. = V(a 0 (A-8) 

and 

V. = ^ F(0 (A-9) 
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Appendix B 

Some Important Polynomials and the Fundamental Identity 


Throughout this section / will denote a fixed subset of 
{0, 1, . . . , n - 1}. We associate several polynomials with this 
set. For example, the locator polynomial for / is 

Oj (*) = PI (1 -a'x) (B-l) 

«e/ 

The co-locator polynomial is 

’/(*)= n (i -° ,jc) (B - 2) 


Lemma B-2. If i £/, gj v 7 (ar z ) = dr 1 )- In particular, 

cov f/ (a-0 = 0 if and only if K, = 0. 

Proof: This follows from Eq. (B-5) and Lemma B-l . ■ 

Lemma B-3. The polynomials o\{x) are linearly indepen- 
dent, and therefore form a basis for the set of all polynomials 
of degree < |/| . 

Proof: If 

2 X i a l (x) = 0 

ze/ 


In view of Eq. (A-l) we plainly have 

1 -X n = Oj(x) Tj(x) (B-3) 

For each value of i £ / we also define 


on setting x = or*, we would get by Lemma B-l , orj(cr z ) = 0, 
but since (again by Lemma B-l) a 7 (or z ) # 0, this implies that 
= 0. The last statement of the lemma now follows from 
the facts that (a) each oj(x) has degree exactly |/| - 1 and 
(b) there are exactly |/| of them. ■ 


o\{x) 


Oj(x) 

(1 - a' x) 


= n t 1 - a * x ) 

/€/ 


The next lemma deals with the minimal support set /(V) of 
V, which is defined by 

/(V) = {/€/ : K^O} (B-6) 

In what follows, the corresponding polynomials will be de- 
noted by o v (x), 7 v (x), and oo v (x), rather than cr 7 ^(x), etc. 


Finally, let V = (V Qi V x , . . . , V n _ l ) be a vector, such that/ is 
a support set for V, i.e., V { = 0 if i ^ L Then the (V, /) evalua- 
tor polynomial is defined as 


w v>/ oo = 2 ^ °/( x ) ( B ‘ 5 ) 

zG7 


Lemma B-4. 

gcd(a f (x), co v/ (x))= fl (1 - oc‘x) 
/-/(V) 

In particular, gcd(a v (x), ou v (x)) = 1 . 


We will need several lemmas about these polynomials. 
Lemma B-l. For all /,/€/, 


Proof: If o>y 7 (x) had a factor in common with a 7 (x), then 
by Eq. (B-3) coy 7 (a -1 ') = 0 for some i €/. But by Lemma B-2, 
this is true if and only if V ( = 0, i.e., if / G / - /(V). fl 




0 

n.„<> 

k*i 


if / =£ i 
if / = / 


In particular, aj( oc~ l )¥=0. 


We note that for any V and support set /, a 7 (x) divides 
1 - x n and deg cj V 7 (x) < deg a 7 (x). The following theorem 
is a kind of a converse to this. 

Theorem B-5. Suppose a(x) and co(x) are polynomials such 
that <j(x)|1 - x n and deg(co) < deg (a). Then there exists a 
vector V and a support set / for V such that 


Proof: Follows immediately from Eq. (B4). 


a(x) = Xa 7 (x) 


(B-7) 
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GJ(X) = \0J VJ (X) 


(B-8) 


= TiW x )( l - jc ”) 

i*e/ 


for a nonzero constant X. Furthermore, if in addition gcd(a(x), 

oo(x)) = 1 , then in fact there exists a vector V such that = w v,/^) ~ * ) 


o(x) = \Oy(x) 
co(x) = \oj y (x) 


(B-9) The following Corollary to Theorem B-6 tells us how to 
reconstruct the nonzero components of V from a 7 (x) and 
B-10) Wy ( /(*). It involves the formal derivative o’j(x) of the poly- 

nomial a 7 (x). 


Proof: Suppose a(x) 1 1 -x n . Then Eq. (B-7) must hold for 
some subset /of {0,l,...,/?-l} and some nonzero constant 
X. Since deg(co) < deg (cr) = deg(a 7 ), and since the polynomials 
o'j{x) are linearly independent by Lemma B-3, 


w(*) = X 22 u i a j( x ) (B-ll) 

IGI 

for certain constants u t . Thus if we define 


Corollary B-7. If / is a support set for V, then for each i G I, 
we have 


V t = -a 


,' W V, 






(B-15) 


Proof: If we differentiate the fundamental identity in Eq. 
(B-13) we obtain 


1 u. if ; e 7 

(B-12) 

0 if/^7 

Eq. (B-10) follows on comparing Eq. (B-ll) to Eq. (B-5). 
Finally, by Lemma B-4, gcd(a 7 (x), u V / (r)) = 1 if and only if 
I = 7(V), and so if gcd (a(x), co(jc)) = 1, Eqs. (B-7) and (B-8) 
become Eqs. (B-9) and (B-10). ■ 

The next theorem is the most important result in this 
section. 


o 1 (x)V'(x) + a' I (x)V(x) = <ji VI (x)(-nx n ~ l ) 

+ Wy ;(x)(l - X n ) 

(B-16) 

Note that if ,v = a~‘ with i G I, from Eqs. (B-3) and (A-l) we 
see that both o f (x) and 1 - x n vanish. Thus if x = or*, Eq. 
(B-16) becomes 


Theorem B-6. If I is a support set for V, then the poly- 
nomials V(x), o T (x), and gj v 7 (x) satisfy 

Oj(x) V(x) = a/y^Ml - x n ) (B-13) 

Proof: Using the definitions in Eqs. (A-4) and (A-7), 
together with the fact that / is a support set for V, we find 
that 

1 

V00 = £Va" (B-14) 

/£/ /=0 

According to Eq. (B-4), Oj(x) = o‘(x) (1 - a' x) for all / G /, 
and so from Eq. (B-14) we have 


Oj(x) V(x) = ^ V.o'jix) (1 - a' x) ^ x> a‘> 
iei /= o 


ct' ( a* 1 ') V(a‘) = -«a'co v / (a - ') (B-17) 

But from Eq. (A-9), V(ol~*) = nV r This fact, combined with 
Eq. (B-17), completes the proof. ■ 

Corollary B-8. gcd(K(x), 1 -x n ) = r v (x). 

Proof: From Eq. (B-3) with I - /(V), we have 1 - x n = 
o v (x) r v (x). Then, if we divide both sides of Eq. (B-13) by 
a v (x), we get ^(x) = co v (x) 7y(x). Since by Lemma B-4, 
gcd(ay(x), cuy(x)) = 1 , the Corollary follows. ■ 

Now we can prove a kind of converse to Theorem B-6. 

Theorem B-9. Suppose that the vector V is given, and that 
for certain polynomials a(x) and cj(x) we have 

a(Jc)V(x) = oj(x)(1 -x n ) (B-18) 
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Then there exists a polynomial A(x) such that 


and co(x) = cj 0 + • * • + co d _ l x d ~ 1 , then it follows from Eqs. 
(B-26) and (B-27) that (i Uj ) satisfies an HDE with characteristic 
a(x) = A(x) ffy (x) (B-19) polynomial a(x) if and only if 


«W = X(x)co y (x) (B-20) 


cr(x)w(x) = co(x) (B-28) 


Proof: By Eq. (B-18) we have 

a(x) V(x) = 0 (mod 1 -x") (B-21) 

This implies that 

a(x) = 0 (mod 1 ( (B-22) 

\ gcd(l -x n ,f(*)) I 

But by Corollary B-8, gcd(l - x", F(x)) = r v (x), and from 
Eq. (B-3), 


where deg(to(x)) < deg(a(x)). In particular, the sequence 
(iij) is periodic of period «, i.e., Uj = u^ n for / >n, if and only 
if there is a polynomial £2(x) of degree < n such that 

(1 -x")u(x) = £2(x) (B-29) 

where deg ft < n. The following theorem is needed in the dis- 
cussion of the frequency -domain decoder. It assumes that (u/) 
is a sequence that satisfies a dth-order HDE with characteristic 
polynomial o(x), as described by Eq. (B-28). 


(1 - *") 

TyO) 


OyC*) 


and so Eq. (B-22) is equivalent to Eq. (B-19), for a suitable 
polynomial A(x). Then Eq. (B-18) becomes 


Theorem B-10. If a(x) divides 1 -x n , then the sequence 
(uj) has period n . Conversely, if (w ; ) has period n and if a(x) = 
a 0 (x) o 1 (x) # where a 0 (x) divides 1 - x n and gcd(aj(x), 
co(x)) = 1 , then a(x) divides 1 - x n . In particular, if gcd(o(x), 
1 — x n ) = 1 , then u(x) 1 1 -x”. 


X (x) a y (x) K(x) = cu(x) (1 - x”) (B-23) 

but multiplying Eq. (B-13) by A(x) we obtain 

A(x) a y (x) F(x) = X(x) co y (x) (1 - x") (B-24) 

Comparing Eq. (B-23) to Eq. (B-24), we see that 

co (x) = X(x) Wy (x) (B-25) 

as asserted. This completes the proof of Theorem B-9. ■ 

The next results in this section deal with homogeneous dif- 
ference equations (HDEs). We say that the infinite sequence 
u Q , « 1} . . . satisfies a dth-order HDE if there exist constants 
o 0 , ,o d , with o Q =£ 0 and o d ^ 0 such that 

d 

2 a k u /.fc = 0 {oi/ >d (B-26) 

= 0 

The polynomial a(x) = a 0 + • • • + o d x d is called the charac- 
teristic polynomial of the HDE, and the degree d of u(x) is 
called its order. If we define 


Proof: If a(x) divides 1 - x", then 1 - x n = o(x) r(x) for 
some polynomial r(x). If we multiply both sides of Eq. (B-28) 
by r(x), we obtain (1 -x") m(x) = co(x) r(x). But deg(oc(x) • 
r(x)) < deg(a(x) r(x)) = «, and so by Eq. (B-29) (w ; ) has 
period n . 

Conversely, if Eq. (B-29) holds, and we multiply Eq. (B-28) 
by 1 - x n and Eq. (B-29) by a(x), then we find that ft(x) 
• a(x) = co(x) (1 - x”). Therefore a(x) |cj(x) (1 - x"). 
Since a(x) = a Q (x) a x (x) and a 0 (x)|l - x n , it follows that 
a 1 (x)|co(x)(l - x n )j a Q (x). But gcd(o x (x), co(x)) = 1, and 
this means that a 1 (x)|(l -x")/o 0 (x), which implies that 
a(x) = a 0 (x)a 1 (x)|l -x w . ■ 

Having briefly discussed homogeneous difference equations, 
we are now in a position to discuss circular homogeneous dif- 
ference equations (CHDEs). We say that the finite sequence 
(w 0 , . . . , u n _ x ) satisfies a dth-order CHDE if there are con- 
stants a 0 , a l , . . . , o d with a 0 ^ 0 and o d ^ 0, such that 

d 

2 ° k u j-k = 0 for / = 0 1 (B-30) 

k = 0 


o k u._ k for; = 0 d- 1 (B-27) 

k = 0 


where the subscripts must be inieipieted mod n. The poly- 
nomial a(x) = o Q + c J jX + * • * + o d x d is called the characteristic 
polynomial of the CHDE, and d is its order. If we define 
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(B-33) 


w(x) = u Q + u x x + • * • + u n _ x x n , then Eq. (B-30) holds if and 
only if 



1 -x” 

gcd OO), 1 -x”) 


a(x)w(x) = 0 (mod 1 -x”) (B-31) 

which is a divisor of 1 - x n . Thus Theorem B-6 says that V sat- 
Equivalently , (w 0 , . . . , u n ~\) satisfies a CHDE if and only if isfles a CHDE of order |/|, where / is any support set for V. 
there is a polynomial w(x) such that Conversely, Theorem B-8 says that V does not satisfy a CHDE 

of order lower than |/(V)|. But we know from Eq. (B-6) that 
a(x)w(x) = a?(x) (1 -x") (B-32) |/(V)| = weight (V)and so we have proved Theorem B-ll. 


Plainly, the a(x) of smallest degree such that Eq. (B-32) holds Theorem B-ll. The weight of V is the degree of the least- 

is order CHDE satisfied by V. 
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Appendix C 

The (a(x), b(x), ix, v) Problem 


Given polynomials a(x ), b(x ), with deg(Z>) < deg (a) = m, 
and nonnegative integers ju, v with \x + v = m - 1, consider the 
set 5 = S(a, 6, /i, y) of all pairs of polynomials (a(x), go(x)) 
such that 

deg(a) < \x deg(co) < v (C-l) 

a(x)Z>(*) = go(x) (mod a(x)) (C-2) 

Theorem C-l. If [x + v = m - 1, the set S(a f b , /i, v) is not 
empty. Indeed, there exists a pair ( o Q , go q ) G S(a, b, jx, v ) such 
that every pair (a(x), co(x)) ES(a, b,fx, v) is of the form 

fW = (c-3) 

u>(x) = k(x)u 0 (x) (C-4) 

Furthermore, (a 0 , go 0 ) is unique up to multiplication by sca- 
lars. We summarize this by saying that (a 0 , co 0 ) “solves the 
(a(x), b(x), fx , p) problem.” 

Proof: A proof is given in [3] , Theorem 8.5, where it is also 
pointed out that Euclid’s algorithm can be used to find (a 0 , 
go 0 ). Specifically, if one applies Euclid’s algorithm as described 
there to the pair (a(x), Z>(x)), and stops when the degree of 
the remainder r y (x) becomes < v for the first time, then (r ; (x), 
r ; (jc)) is the solution to the (a(x), Z>(x), jx, v) problem. ■ 

Lemma C-2. (a, go) G S(a , Z>, ju, v) solves the (, a , Z>, /i, v) 
problem if and only if there exists a polynomial r(x) such that 


kco 0 = ko Q b + ra (C-7) 

But since (a 0 , co 0 ) G S(a , b, ix, v), we know that co Q = o Q b 
+ r Q a for some polynomial r Q (x), and so we have 

ku> Q = ko Q b +kT Q a (C-8) 

Comparing Eqs. (C-7) and (C-8), we see that r = kr 0 , and so 
/:|gcd(co 1 , o t , r x ). This implies by Eq. (C-6) that k is a scalar 
and so (a, go) is a scalar multiple of (a 0 , go q ), i.e., (a l5 coj) 
solves the ( a , b , p, v) problem. ■ 

Lemma C-3. If (a Q , co Q ) solves the (a, Z>, /i, i>) problem, then 

gcd (a Q , oj 0 ) = gcd («j 0 , a) (C-9) 

Proof: By Lemma C-2 we know that 

w 0 = a o b + V (C-10) 

with 

gcd(w 0 ,CT 0 ,T 0 ) = 1 (C-l 1) 

Now by Eq. (C-10) any common divisor of a 0 and a must 
divide a> 0 , i.e., gcd(a 0 , a)|gcd(a 0 , w 0 ). On the other hand, 
Eq. (C-10) also says that any common divisor of ct 0 and to 0 
must divide T 0 a and so by Eq. (C-l 1 ) must divide a. Thus 
gcd(a 0 , w o )|gcd(cr o , 0 ). ■ 


cc = ob +ra (C-5) 

where 

gcd (go, a, t) = 1 (C-6) 

Proof: Suppose that (a, co) solves the problem. Then by Eq. 
(C-2), there exists a polynomial r(x) such that Eq. (C-5) holds. 
If co(x), a(x), and r(x) had a common factor d(x), then with 
co' = co/d , o' = o/d , and r = r/d , Eq. (C-5) imphes go' = ob 
+ r a, which means ob = go' (mod a) is a smaller degree solu- 
tion to Eqs. (C-l) and (C-2), contradicting the minimality of 
(o, 70- 


Theorem C-4. Conversely, given polynomials a(x), a Q (x), 
and co 0 (x) such that Eqs. (C-l) and (C-9) hold, there exists a 
polynomial b{x) of degree < m - 1 such that (a 0 , a> 0 ) solves 
the (a(x),b(x),n,v) problem. 

Proof: Let d(x) = gcd(a 0 (x), w 0 (x)) = gcd(a 0 (x), a(x)), 
and Oj = ajd, coj = io 0 /d, a x = a/d. Since gcd(Oj , a, ) = 1 we 
can define 


GO 

b' = —mod a, (C-12) 

o, 1 


Conversely, suppose Eqs. (C-5) and (C-6) hold, and that 
(o 0 , co 0 ) solves the (a, b, ju, v) problem. Then by Theorem C-l, 
go = kco 0 , a = ko Q , and Eq. (C-5) becomes 


It follows that 


o^’ - go x (mod^) 


(C-l 3) 
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i.e., 

w, = (C-14) 

for a suitable polynomial (x). We note that 

gcd( o v t x ) = 1 (C-15) 

since a common factor of o x and t 1 would by Eq. (C-14) also 
divide cOj ; but gcd(aj , cc^) = 1. We now distinguish two cases, 
according to whether d and t 1 have a factor in common or 
not. 

Case 1: gcd (<i, r x ) = 1. In this case if we multiply Eq. 
(C-14) by d we obtain 

w o = % b ' +T i a (C-16) 

with gcd(o Q , oj 0 , r J ) = gcd W, r, ) = 1, and so by Lemma C-2, 
(a 0 , co 0 ) solves the ( a , b', p, problem, where b ' is defined 
by Eq. (C-12). 

Case 2: gcd(<i, Tj) 1. In this case, let \(x) be the product 
of all the irreducible polynomials which divide p but do not 
divide r x , i.e., 

X = P| {p : p is irreducible, p I d f p X Tj} (C-17) 
Next, we define 

b = b f + Xa x 

(C-18) 

r o = r i - Xff x 

Then from Eq. (C-14) it follows that 

co 1 =a 1 h + r 0 « 1 (C-19) 

If p is an irreducible divisor of d , then p cannot divide r 0 , be- 
cause if p | Tj then p X o x by Eq. (C-15) and p l X by Eq. 
(C-17), so that p K r 0 = t 1 - Xo x . On the other hand, if p X t x , 


shows that gcd (d, r Q ) = 1. Thus if we multiply Eq. (C-19) by 
d we obtain 

= °o b + r o a ( C ‘ 20 ) 

with gcd(co 0 , a 0 , r 0 ) = gcd(d, r 0 ) = 1, and so by Lemma C-2, 
(a 0 , co 0 ) solves the (a, b, p, i>) problem, with b defined in Eq. 
(C-18). ■ 

Example: Suppose a(x) = x m . Then the construction of 
Theorem C-4 simplifies considerably. Indeed, suppose we are 
given polynomials o 0 (x) and cj 0 (x) such that Eqs. (C-l) and 
(C-9) hold , with a (x) = x m . Then gcd (a 0 , co Q ) = gcd (a 0 , x m ) = 
x* for some value of /. If b{x) is such that Eq. (C-2) holds, 
then dividing by we obtain 

o x b = u> x (modx m ~i) (C-21) 

where gcd (o 1 ,x m ~ ; ) = 1. Thus if (a 0 , co 0 ) is to solve the 
(x m , b, p, a) problem, then it must be true that 

b ( x ) = ~ (mod x m ~’) (C-22) 

If /' = 0, i.e., if gcd(a 0 , co 0 ) = 1 , then Eq. (C-22) is both neces- 
sary and sufficient. On the other hand, if/' > 1 , it is easy to see 
that Lemma C-2 implies that (a 0 , to 0 ) solves the (x m , b, q. v) 
problem if and only if Eq. (C-22) holds and if in addition 

b(x) ? p. (mod x m ~ ,+ 1 ) (C-23) 

Thus if we expand co l /o l as a power series, viz. 

= a 0 +a i x + ‘ " ( C ' 24 > 

then any b(x) = b 0 + b x x + * * * + w *ll do, provided 

that 

b k - a k for k = 0, . . . , m - j - 1 


(C-25) 
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Performance of Efficient Q-Switched Diode-Laser-Pumped 
Nd:YAG and Ho:YLF Lasers for Space Applications 

W. K. Marshall, K. Cowles, and H. Hemmati 
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Solid-state lasers pumped by continuous-wave diode lasers can be Q-switched to obtain 
high-peak-power output pulses. In this article, the dependence of laser-pulse energy , aver- 
age output power , peak power, and pulse width on pulse-repetition frequency in Q-switched 
Nd:YAG and Ho: YLF lasers is determined and compared. At low pulse-repetition rates, 
the much longer upper-state lifetime in Ho :YLF gives a distinct advantage. At higher 
pulse rates, the overall laser efficiency and the stimulated emission cross section are more 
important parameters , leading to an advantage for Nd.YAG. The results are of signifi- 
cance for designing lasers for use in space optical communications and remote sensing 
systems. 


I. Introduction 

Diode-laser-pumped solid-state lasers such as neodymium: 
yttrium aluminum garnet (Nd:YAG) and holmium: yttrium 
lithium flouride (Ho:YLF) are prime candidates for laser- 
light sources for use in deep-space optical communications and 
other applications. Due to the long upper-state lifetimes of the 
Nd 3+ and Ho 3+ ions doped into the YAG and YLF crystals, 
the energy storage capacity of such laser materials is quite 
high. Using a technique such as Q switching or cavity dumping, 
the stored energy can be extracted in the form of short laser 
pulses with high peak power. (This form is optimal for direct- 
detection optical communications use.) The pulse-repetition 
rate is controllable using an electro-optical or acoustical- 
optical device for Q switching. 

A laser Q switch is effectively an intracavity shutter-when 
the Q switch is closed, optical pumping continues, but stimu- 
lated emission does not occur. During this time, energy is con- 
tinuously stored in the upper laser level. When the Q switch is 


suddenly opened, the stored energy is released in the form of 
a “giant pulse” of laser light, depleting the upper laser level. 
The next laser pulse occurs only after a sufficient population 
of ions is again placed in the upper laser level. 

Efficient diode-pumped continuous-wave (CW) Nd:YAG 
[1] and Ho: YLF [2] lasers were first demonstrated at JPL. In 
this article, the expected output power and pulse character- 
istics of diode-pumped, Q-switched (pulsed) Nd:YAG and 
Ho:YLF lasers are analyzed. In Section II below, the basic 
theory needed to calculate the results of interest is given. In 
Section III, the factors relevant to a comparison of Q-switched 
lasers are explicitly considered. In Section IV, the results are 
applied to specific laser examples. 

II. Basic Theory 

The most important factor determining the pulse shape and 
pulse energy in a Q-switched laser is the population inversion 
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density 1 of the lasing medium just prior to the opening of 
the Q switch. In this section, we summarize the basic theory 
[3] for determining population inversion, power output, and 
pulse width in a CW-pumped, Q-switched laser. 

In a series of periodic laser pulses, the population inversion 
density of the initial state (i.e., just before a laser pulse), n fy 
depends on the inversion density of the final state (i.e., just 
after the preceding laser pulse), n f , and on the amount of 
pumping that occurs during the period between the laser 
pulses, according to the equation 


The population inversion just after the laser pulse, n f, is 
given 2 by the (transcendental) equation 

n i - «/ = n th In («/”/) ( 5 ) 

This equation, together with Eq. (1) above, determines the 
values for n i and for a given set of conditions. Once the 
change in the population inversion n i -rif is known, the 
energy per pulse is given simply by 

E pulse = {n.-n f ynhvV (6) 


«/ = -(«. o -n f )e 1,Tsf (1) 

where / is the pulse -repetition frequency and r s is the upper- 
laser-level spontaneous decay time. This equation is for con- 
tinuous pumping at a uniform rate. The asymptotic density, 
n ^ , depends on the pumping rate and is the maximum achiev- 
able inversion density (approached when 1// » r s ). The value 
of n ^ can be calculated from knowledge of the CW output 
power, P cw , obtained when the cavity Q is maintained at its 
maximum value: 


The average power output from the laser is then 

P = E . / (7) 

avg pulse J v ' 

The pulse shape is determined by the laser rate equations. 
From those equations, the peak laser power is found [3] to be 

Vhv In 

P = 

peak ] 



n 


oo 


P ew r s 

•qhvV +n '>' 
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Here, hv is the photon energy, and V is the effective lasing 
volume, n th is the threshold inversion density, and 77 is the 
output coupling factor. The latter two quantities are given 
[4] , [5] by 
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for stimulated emission cross section a, length of laser rod /, 
Q-switch maximum single pass transmission £, reflectivities of 
output and rear mirrors, r x and r 2 , respectively, and laser-rod- 
loss coefficient /}. 


where t R is the round trip cavity time. Since t R = 2 Ljc where 
L is the cavity optical path length and c is the speed of light, 
a significant consequence of the equation for P peak is that the 
peak output power of a Q-switched laser is inversely propor- 
tional to the cavity length. 


III. Comparison of Lasers 

Consider first the energy per pulse, E pulse . Use of Eqs. (1), 
(2), and (6) gives 

- r ,s. (9) 

where ft,* = 0 n -n th )!n th and n f = (n th - n f )/n th . The vari- 
ables and fif are normalized versions of their “unhatted” 
counterparts, and have ranges 0 <n' ot> <°°, and 0 < < 1 (i.e., 

the final inversion density is less than the CW laser threshold). 

The exact value of ny can be determined only by solving 
Eqs. (1) and (5) numerically. Alternatively, we note from the 


^he inversion density, is the difference between the population 2 Subject to the approximation that the effect of pumping during the 
density of the upper laser level and that of the lower laser level. laser pulse is negligible. 
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form 3 of Eq. (5) that n f < ri r Since n { < 'n V3 by definition, 
0 < nf<n^ and hence the rightmost factor in Eq. (9) ranges at 
most between 1 and 2. 

Thus, within a factor close to unity, Eq. (9) says that the 
energy per pulse for a Q-switched laser depends only on the 
laser’s CW output power level, the upper-state lifetime for the 
laser, and on the pulse-repetition frequency, and not on other 
factors such as the laser-stimulated emission cross section or 
the passive characteristics of the laser cavity. 


IV. Specific Laser Examples 

Now consider and compare Nd:YAG and Ho:YLF lasers 
characterized by a single-mode CW output power of 100 mW 
and a cavity length of 3.5 cm. Other laser parameters are given 
in Table 1. The most significant difference between the two 
lasers is that the lifetime of the upper laser level, r s , is about 
50 times longer in Ho:YLF than in Nd:YAG. The parameters 
for the 100-mW lasers were chosen to represent the (CW) 
lasers reported in [1] and [2]. Scaling to higher powers is 
considered briefly at the end of this section. 


For high pulse rates / l/r 5 , Eq. (9) reduces to E pulse — 

P cw lf , i.e., effectively the laser’s CW power is collected over 
the pump time 1// and emitted as a short pulse. For low pulse 
rates, the pulse energy saturates as the pumping time becomes 
long compared to the “storage time” r s . In this latter case, 
Eq. (9) reduces to E pulse =*P cw t s . 


Also of significance in comparing two lasers are the respec- 
tive pulse widths. The laser pulse width limits the modulation 
alphabet size, limits the ability to reduce background light by 
narrowing the signal slot width, and (in the extreme) limits the 
maximum achievable Q-switched pulse frequency. The pulse 
shape can be determined exactly only by numerically inte- 
grating the laser-rate equations. Here we calculate only an 
estimate (actually a lower bound) for the laser pulse width 
given by 


pulse 


Epulse 


peak 


( 10 ) 


where E pulse and P peak are given by Eqs. (6) and (8) above. For 
high pulse rates such that / » 1/r and n t - n th <<C n th> this 
reduces to 


pulse 


t R nhvVf 
~2dP 


(ID 


i.e., the pulse width is directly proportional to the pulse rate. 
For low pulse rates,/ <K \/r s , the pulse width approaches a 
constant (minimum), as both the pulse energy and the peak 
power saturate. This constant value is given by 


Am in ) _ E 

1 pulse 21n(l /r { ) rt^ - ln^ + 1) * } 


where wL is defined above. 


Equation (5) can be rewritten as (1 + nj) - ln(l + - (1 - ny) 

- /«(1 - where n f = (n f - n th )ln th . 


Figure 1 shows a plot of the energy per pulse, E pulse , (given 
by Eq. (9) with the factor (1 +ny/?T 00 ) set equal to unity) as a 
function of the pulse-repetition frequency. For both lasers, 
the initial state inversion density saturates as the pumping 
time (1 If) begins to be long compared to the respective upper- 
state lifetimes. The saturation value of the pulse energy is 
(t s X 100 mW); hence, for low pulse rates, the Ho:YLF laser 
pulses are about 50 times larger than those for Nd:YAG. For 
high pulse rates, the pulse energies become asymptotically 
equal. (At intermediate pulse rates, the Ho:YLF pulse energy 
is always larger.) 

Figure 2 shows the laser average output power, P , versus 
frequency. The average power goes to zero for low repetition 
rates, and approaches the CW laser power (100 mW) at high 
pulse-repetition rates. Again, values are always higher for the 
Ho:YLF laser. 

Figure 3 shows the estimate on the laser pulse width given 
by Eq. (10). At low pulse-repetition rates, the Ho:YLF pulse 
width is about three times, smaller than the Nd:YAG pulse 
width, due mainly to the effect of a higher n , « for the Ho: YLF. 
For higher pulse rates (above ~ 10 3 per second), the pulse 
widths are controlled by Eq. (11); here, since hv and 1/a are 
smaller for Nd:YAG than for Ho:YLF, the Nd:YAG pulse 
width is smaller by a factor of about 4. In the frequency range 
of 10 s to 10 6 per second (upper right corner of Fig. 3), the 
calculated lower bound on the pulse width for both lasers 
exceeds the pulse interval l//-the laser will not operate in a 
simple Q-switched pulse mode at those frequencies. 

Now consider a simple scaling example-two 1-W lasers. 
(Laser parameters are the same as those given in Table 1, 
except P cw = 1 W.) It can be seen from Eq. (9) that E pulse scales 
linearly with P cw (subject to the validity of the approximation 
Hf « n' <x> ). Therefore, values of E pulse (and P avg ) for 1-W 
lasers are a factor of 10 larger than those for the 100-mW 
lasers shown in Figs. 2 and 3. Values for t pulse are smallerfor 
the 1 -W lasers than those shown in Fig. 3 for 100-mW lasers. For 
high pulse rates, Eq. (1 1) gives a reduction in pulse width by a 
factor of 10. At lower pulse rates, the reduction is not a linear 
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factor-calculated values of for the 1-W lasers are 

1.6 ns and 3.4 ns for HoiYLF and NdiYAG, respectively. 

V. Conclusions 

The Ho:YLF laser shows promise for use in low-data-rate 
optical communications systems, such as would likely be used 
in an Earth-to-spacecraft uplink in an all-optical communica- 
tion system. At pulse repetition rates below 10kHz,Ho:YLF 
offers higher performance than Nd:YAG in terms of higher 
energy per pulse for similar CW lasers. The upper-state life- 
time, t s , is the dominant parameter in determining Q-switched 
pulse energy in this regime. Below 1 kHz, the HoiYLF also 
offers shorter laser pulse widths. The large value of r s makes 
it easy to pump HoiYLF far above threshold, resulting in 
short pulse widths. At higher pulse rates, r s becomes less 


important in determining the pulse energy. In the extreme, the 
pulse energy approaches P cw lf \ and hence for two lasers with 
equal input (pump) powers, the pulse energies depend only on 
the relative CW laser efficiencies. In this regime, NdiYAG 
offers shorter pulses, due mainly to its higher stimulated emis- 
sion cross section, a. 

Higher CW power lasers lead to proportionately higher 
pulsed-mode output powers, and also to shorter laser pulse 
widths. Hence for communications systems, the benefit of 
higher-power lasers comes not only from basic transmitter 
power considerations, but also from the ability to limit back- 
ground noise by using shorter communications slot widths. 
An understanding of the parameters affecting pulsed laser 
operation is important in designing communications lasers 
for maximum system performance. 
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Table 1. Parameters for Nd:YAG and Ho:YLF lasers of Section IV 


Parameters 

Nd:YAG 

Ho:YLF 

P cw , mW 

100 

100 

v ms 

0.23 

12 

/um 

1.064 

2.06 

F, cm 3 

0.02 

0.02 

/, cm 

1.0 

1.0 

L, cm 

3.5 

3.5 

r \ 

0.975 

0.95 

r 2 

1.00 

1.00 

$ 

0.987 

0.987 

p, cm' 1 

0.0023 

0.0023 

cr, cm 2 

8.7 X 10' 19 

1.0 X 10' 19 


c -> 
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Fig. 1. Output energy per pulse, £ pu/se , in mJ, versus pulse 
repetition frequency, f, in Hz, for Nd:YAG and Ho:YLF lasers 
described in Table 1. The curve marked Nd:YAG (XI 0) is the 
Nd:YAG curve multiplied by a factor of 10 for clarity. 



Fig. 2. Average output power, P avg , versus pulse repetition 
frequency, f, for lasers of Table 1. 



Fig. 3. Laser pulse width, f /sa , versus pulse repetition frequency, 
f, for lasers of Table 1. 
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For deep-space pulse -position modulation (PPM) optical communication links using 
Nd:YAG lasers , two types of laser transmitter modulation techniques are available for 
efficiently producing laser pulses over a broad range of repetition rates: Q-switching and 
cavity dumping. The desired modulation scheme is dependent on the required pulse 
repetition frequency and link parameters. These two techniques are discussed , theoretical 
and numerical calculations of the internal energy of the laser cavity in cavity dumping are 
described , and an example of cavity dumping is applied to a link for a proposed experi- 
ment package on Cassini. 


I. Introduction 

A link -analysis approach is a standard aspect of the develop- 
ment and design of a communications system. It is essential to 
have confidence that the component performances assumed in 
these link calculations are realizable. Because some of the key 
components in optical communications are still in the develop- 
ment phase, it is necessary to use theoretical analyses to sup- 
port the performance assumptions made in the link studies. 

One of these key components is the laser transmitter. The 
laser most likely to be used is a neodymium-doped yttrium 
4 aluminum garnet (Nd:YAG) crystal end -pumped by laser 
, diodes. In contrast to flash-lamp pumping, laser diodes can 
provide pump light at one of the atomic resonant absorption 
bands of the Nd 3+ ion to improve pumping efficiency. (It is 
the rare-Earth ion Nd 3+ in the Nd:YAG that lases. The YAG 
is simply the host matrix.) By mode-matching the pump light 
into the laser cavity, the absorption of pump photons is made 


to occur exactly where the lasing takes place, and the absorp- 
tion length is greater than in the common side-pumping geom- 
etry. With this architecture, overall electrical-to-optical effi- 
ciencies in excess of 10% have been demonstrated [1] . 

In order to implement deep-space optical communications, 
the extremely energy-efficient pulse-position modulation 
(PPM) scheme will be used. This modulation format puts 
severe demands on the performance of the laser transmitter, 
and it is very important to verify that the required performance 
that has been assumed in link calculations is achievable. In 
this article we report on progress in our understanding of the 
behavior of a modulated laser used for deep-space communi- 
cations. Two regimes of modulation, Q-switching and cavity 
dumping, are discussed, and a study of a laser performing in 
the latter mode follows. Although this study does not model 
cavity dumping completely, it does provide valuable insight 
into the process. We evaluate the buildup of energy prior to 
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the emission of a signal pulse. This is a necessary step to insure 
that the energy required for deep-space optical communica- 
tions is available in the laser cavity using realistic system 
parameters. The cavity-dumping analysis here is applied to an 
example of an optical communications link from Cassini dur- 
ing its interplanetary cruise. A detailed analysis of Q-switching 
will be presented in a future report. 


II. Modulation Techniques and Applications 

Two of the common techniques for achieving high-peak- 
power pulses from the Nd:YAG lasers are Q-switching and 
cavity dumping. In Q-switching, the energy is stored in the 
atomic population inversion by keeping the Q of the cavity 
too low to support laser oscillation. This is accomplished with 
the use of an element in the cavity whose loss can be controlled. 
Atoms are pumped to the upper state, but in the absence of 
stimulated emission, the upper-state population will be greater 
than in the equilibrium condition achieved when lasing occurs. 
When the Q is increased (by reducing the loss), the energy in 
the atoms is immediately available, and the stimulated-emission 
rate becomes large. A high-energy pulse then depletes the 
upper level, and lasing temporarily ceases. If the Q is reduced 
at that point, the pump energy will again begin accumulating 
population in the upper state. Q-switching has an upper limit 
imposed by the finite time required to repump the population 
inversion and by the cavity-field buildup time [2] . A pulse- 
repetition frequency (PRF) on the order of 50 kHz is the 
maximum value that can provide high-peak-power pulses from 
Q-switched Nd: YAG. 

For pulse rates much higher than 50 kHz, the technique of 
cavity dumping is preferred. Although cavity dumping can be 
extremely efficient at frequencies of many megahertz, it is 
less efficient at low pulse-repetition rates. As PRFs increase, 
the choice between Q-switching and cavity dumping will 
depend on specific laser design parameters and link require- 
ments. The optimal transition point will be understood after 
further study. In cavity dumping, instead of storing the energy 
in atoms, the energy is stored in the photon field of the cavity. 
The output-coupling strength is varied so that the energy in 
the cavity is extracted when it is needed. The laser is kept 
above threshold during the entire process. 

Both of these modulation techniques have application to 
deep-space optical communications. Examples of specific 
optical links between a planetary spacecraft and Earth-based 
receivers will illustrate this. We have proposed to include an 
optical-communications package on Cassini. Second in the 
series of Mariner Mark II spacecraft, it will be targeted for 
Saturn orbit and will release a probe into the atmosphere of 
Titan. Currently, launch is expected in 1996. Although the 


prime communications system will use radio-frequency tech- 
nology, there may be an opportunity to include an optical 
communications experiment package to prove out its tech- 
nology, increase the data return rate, and perform a number of 
“light science” experiments which take advantage of the on- 
board laser, telescope, and other optical components. 

One configuration of the Cassini optical package uses a 
30-cm telescope for the transmit/receive antenna. A frequency- 
doubled Nd:YAG laser with an average power of 1 W would 
serve as the transmitter. Transmitting to a 10-m Earth-based 
receiver under clear skies [3] , this system could return over 
115 kb/sec from 9 AU. This includes Saturn being in the 
field of view of the receiver, and the calculated link margin is 
3 dB [4]. To achieve this impressive performance, a PPM 
alphabet size of M=256 is used, and the width of each slot is 
10 nsec. With the use of coding, the bit error rate is 10“ 5 . 

Because PPM with M=256 transmits 8 bits per pulse, a data 
rate of 115 kb/sec requires 14,375 pulses per second. The 
duty cycle is obviously quite low, the laser being on for a 
total of only about 144 Msec each second. The dead time 
between the 256-slot words is 67 Msec. This mode of operation 
is comfortably in the Q-switch regime. With an average laser 
power of 1 W, each of the pulses has a peak power of almost 
7 kW. 

During interplanetary cruise, Cassini may be used to demon- 
strate much higher data rates. Using 256-ary PPM with 10-nsec 
slot widths and about 2.6-Msec dead time between words, the 
optical communications package could return 1.54 Mb/sec 
from 5 AU (the distance of Jupiter) with a 3-dB margin. This 
does not assume Jupiter to be in the field of view. To transmit 
1.54 Mb/sec with M=256 PPM, the laser is required to emit 
192.5 kilopulses per second. To maintain 1 W average power, 
each pulse requires a peak power of 519 W. Based on our 
present understanding, maximally efficient performance at 
this PRF necessitates the use of cavity dumping. 

These examples illustrate the importance of both Q-switch- 
ing and cavity dumping for deep-space optical communica- 
tions. Detailed understanding of laser performance under both 
operating conditions is essential. In the following section, we 
present calculations of a laser using cavity dumping to achieve 
the higher Cassini data rate from 5 AU. 


III. Analysis and Calculations of Cavity 
Dumping 

It is important for us to understand the details of the 
behavior of a Nd:YAG laser operating in a cavity-dumping 
mode in order to make accurate predictions of its performance 
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and design an efficient system. Following the work of Chesler 
and Maydan [5] , we can calculate the approximate perfor- 
mance of a Nd:YAG laser on Cassini at 5 AU as discussed 
above. These calculations describe the population inversion 
and internal field of the laser during buildup in preparation for 
emitting an output pulse. This initial approach to modeling 
cavity dumping does not include frequency doubling or the 
output pulse generation and its characteristics. But we shall see 
that it does allow us to determine and verify some important 
aspects of the laser performance. Chesler and Maydan begin 
with the rate equations for a continuously pumped laser: 

= R - TN- $FN (la) 

and 

= HFN-(e + T)F (lb) 


For a given cavity design, T will be fixed. It can be shown by 
maximizing the output power that the optimum cw values for N 
and F , given fixed T , are7V 0 = e0 1 / 2 /0 and F 0 = T(0 1 / 2 - l)/jj. 
The parameter 0 is defined to be Rfi/Te, which is the ratio of 
the pumping rate to the threshold pumping rate. Of course, 
in storing and dumping the energy in the cavity, the interest 
is in the deviations from the cw performance. Thus, we in- 
troduce n and / to describe these deviations, and we have 
N = N 0 + nN 0 and F = F 0 + /F 0 . 

Chesler and Maydan make a number of reasonable approxi- 
mations to arrive at expressions for these deviations. One of 
the key assumptions is that the duration of an output pulse is 
short compared to the buildup time between pulses. In our 
example, this is seen to be an excellent assumption, since the 
pulse duration of 10 nsec is less than 0.4% of the minimum 
time between pulses. The approximate solutions are found 
to be 


In these equations, TV is the number of atoms in the upper laser 
level; F is the number of coherent photons in the cavity; t is 
time; R is the number of atoms pumped up per second; T is 
the spontaneous-decay rate of the upper laser level; and $FN 
is the number of atoms per second undergoing stimulated 
emission. The stimulated emission coefficient 0 may be ex- 
pressed as co/AL , where a is the laser transition cross section, 
A is the cross-sectional area of the laser beam in the Nd:YAG 
rod, and L is the optical length of the cavity, e = cA/2Z, is 
the reciprocal of the cavity decay time (not including losses 
from intentional output coupling), where A is the round-trip 
fractional inherent cavity loss. Similarly, T = ca/2L is the 
reciprocal of the cavity decay time (including only inten- 
tional output coupling), where a is the fractional output 
coupling. During the buildup phase of the cavity dumping 
cycle, a = 0. In order to extract a pulse, in the ideal case, the 
value of a would be changed to 1, thus allowing 100% of the 
stored energy to be emitted in a pulse. In reality some losses 
will be incurred in this process, but this analysis considers 
only the internal energy of the laser cavity. 


n 


yr/l 1 + s e rs/T \ 
6 V 2 T ev-l) 


and 


e 7 - 1 

where s = fe, or the time in units of the cavity decay time; 
r/e is the time between pulses, during which the field intensity 
accumulates; and y = r(0 1/2 - 1). 

With these expressions, we can calculate the evolution of 
the upper-state population and the optical field for cavity 
dumping in the regime of validity for these solutions. Because 
the development began with rate equations, the results do not 
apply when the number of coherent photons in the cavity is 
reduced to the order of one. At this level, the statistics of 
spontaneous emission control the buildup of the field, and the 
rate-equation approach is not appropriate. 


These two equations can be understood by considering the 
physical processes involved in laser physics. Equation (la) 
describes the time dependence of the atomic population inver- 
sion. The inversion is increased by atoms being pumped up to 
the upper laser level by the pump source, and it is diminished 
by both spontaneous and stimulated emission. The latter 
effect provides a positive contribution to the photon field in 
the cavity, and that is reflected in the first term on the right 
side of Eq. (lb). This equation describes the time dependence 
of the number of photons in the field of the cavity. The 
second term in that equation reflects the loss of photons 
through inherent and intentional losses in the cavity. 


Our interest now is in finding N/N Q and F/F 0 . We con- 
sider the case of a cavity with inherent loss A = 0.03 and a 
length L = 20 cm. These combine to give a cavity decay time 
of e = 44 nsec. (Recall that e does not include intentional 
output coupling. By increasing the output coupling, a, when it 
is time to emit the energy, 10-nsec pulses can be achieved. The 
technique used in this report to examine cavity dumping does 
not allow us to study the output pulse.) To calculate the 
parameter j3, we use o « 5.75 X 10~ 23 m 2 for Nd:YAG [6], 
and A = 3.14 X 10" 6 m 2 . Thus we find 0= 2.74 X 10~ 8 Hz. 
The fluorescence lifetime of the upper state in the Nd:YAG 
laser line is 230 /isec, so T = 4350 Hz. 
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Using the output coupling which comes from the optimum 
cw values for N and F as outlined above, we can calculate a 
pumping rate which guarantees an average power of 1.0W. 
This turns out to correspond to a pumping rate above thresh- 
old of 0 = 4.9. We know this is achievable, since this value of 
0 is less than that previously demonstrated for diode pumping 
of Nd:YAG lasers [7]. 

With these values, we determine n and/and thusA7iV 0 and 
F/F q as functions of time. The results of these calculations are 
shown in Figs. 1 and 2. 

At time 0 in both figures, the system is beginning just after 
a pulse has been produced. The pump energy is building up 
population in the upper level of the laser line and contributing 
to the field energy. When the field energy passes the cw opti- 
mum value of F - F 0 (Fig. 2), the rate of stimulated emission 
becomes large enough to begin reducing the population in the 
upper state. The upper-state population (N) begins to decline 
(Fig. 1), and it never varies significantly from the cw value. 
When the inversion decreases, energy is transferred into the 
optical field by stimulated emission until the designated time 
to dump the cavity. When it is time to produce a pulse, the 
output coupling (a) is changed, and the internal-field energy 
drops as it is emitted in the narrow pulse. The greatly reduced 
internal field causes a reduction in the stimulated-emission 
rate, so the population inversion begins to increase again and 
the entire cycle starts over. 

From our calculations of the laser performance, we find 
that the cavity accumulates 5.17 fxJ at the maximum. It is at 
that point that the pulse is produced by changing the output 
coupling. Although the approach used here does not address 
the dynamics of the output signal, if we assume that all of 
this available energy is emitted in a IG-nsec pulse, it produces a 
peak power of 517 W. This is within less than 1% of the values 
derived from the Cassini link calculation and is achieved with 
the laser component values we have used. 


IV. Conclusions 

Within the limitations of this initial approach to under- 
standing cavity dumping, we can see that the performance 
assumed for the laser transmitter in the optical link calcula- 
tions is justified. Realistic laser parameters with an achievable 
pumping rate will lead to production of the stored energy 
needed for the Cassini link from Jupiter. 

A detailed understanding of the laser operation during 
cavity dumping is crucial to the design of a laser capable of 
providing the signals needed for the pulse-position modulation 
to be used in our optical-communications system. The approx- 
imate solutions used here provide a starting point for that 
understanding, but more needs to be done. One of the assump- 
tions of the derivation is that the dumping is periodic. Of 
course, since the transmitted information is contained in the 
time during which the pulse is transmitted, varying times 
between pulses must be considered. This would allow the field 
energy and population-inversion energy to continue to evolve 
for different lengths of time between signals. A more detailed 
analysis of this factor would reveal exactly how it affects the 
uniformity of the output pulses. In addition, a study of pulsed 
pumping would be necessary in order to insure that the stored 
energy is maximum just before the output coupling is raised to 
release that energy as an output signal. Further, to achieve 
still higher data rates, operation in a regime where the time 
between pulses is not large compared to the pulse width is 
required, as has been assumed here. For Cassini at Mars range, 
data transmission of 20 Mb/sec is planned. To achieve that rate 
with 10-nsec pulses will require a dead time of only 40 nsec 
and an alphabet size of 16. An analysis of the performance of 
the laser transmitter under these conditions requires use of the 
exact solution. Such an analysis should include the actual 
extraction of the pulse to reveal its characteristics in detail. A 
careful comparison of Q-switching and cavity dumping in the 
PRF range where they overlap will allow the determination of 
the preferred scheme of modulation under different link 
scenarios. 
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Fig. 1. Normalized population inversion as a function of time. 



Fig. 2. Normalized field as a function of time. 
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Optical reception antennas (telescopes) must be capable of receiving communications 
even when the deep-space laser source is located within a small angle of the Sun (small 
solar elongation ). Direct sunlight must not be allowed to shine on the primary reflector 
of an optical reception antenna , because too much light would be scattered into the signal 
detectors. A conventional sunshade that does not obstruct the antenna aperture would 
have to be about five times longer than its diameter in order to receive optical communi- 
cations at a solar elongation of 12 degrees without interference. Such a long sunshade 
could not be accommodated within the dome of any existing large-aperture astronomical 
facility , and providing a new dome large enough would be prohibitively expensive. It is 
also desirable to reduce the amount of energy a space-based large-aperture optical recep- 
tion facility would expend orienting a structure with such a sizable moment of inertia. 

Since a large-aperture optical reception antenna will probably have a hexagonally seg- 
mented primary reflector , a sunshade consisting of hexagonal tubes can be mounted in 
alignment with the segmentation without producing any additional geometric obstruction. 
The tubes can be extended downward toward the primary reflector , until they reach the 
envelope of the focused beam to the secondary reflector. If the optical reception antenna 
is ground-based , the other ends of the tubes may be trimmed so that both the sunshade 
and the antenna will fit within a sphere whose diameter is only six-fifths the diameter of 
the primary reflector : If the segmentation involves four rings of hexagons with the cen- 
tral segment absent from the primary , then this sunshade is useful when solar elongations 
are as small as 12 degrees. Additional vanes can be inserted in the hexagonal tubes to per- 
mit operation at 6 or 3 degrees. The structure of the sunshade is very strong and can be 
used to support the secondary reflector instead of an independent support. 

An analysis of the duration and recurrence of solar-conjunction communications out- 
ages (caused when a deep-space probe near an outer planet appears to be closer to the Sun 
than a given minimum solar elongation ), and the design equations for the integral sun- 
shade are appended. 
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I. The Need for Sunshading 

Direct-detection optical communication at visible wave- 
lengths from laser sources on deep-space probes requires that 
background interference be reduced to acceptable levels. Back- 
ground from natural sources is usually incoherent and can be 
reduced substantially by narrowband filtering. Additional 
immunity to interference is achieved by sending an optical 
pulse only during one time slot of a series of time slots. After 
filtering, the remaining background level must be low enough 
to ensure an acceptably small probability that the background 
count in any empty time slot will be less than the background 
plus signal count in the signal time slot. 

For optical communication to a deep-space probe near an 
inner planet, the background interference may be so high that 
heterodyne detection techniques are required. These reduce 
background by using a post-detection filter whose bandwidth 
is much narrower than the bandwidth of any pre-detection 
optical filter. The spatial coherence of the signal must be pre- 
served, however, which certainly requires good optics and may 
preclude reception through the Earth’s atmosphere. 

In studies of typical missions to outer planets it has been 
shown that the background is acceptably small for direct- 
detection optical communications even when the sunlit planet 
fills a substantial field of view behind the spacecraft ([1], 
Appendix A). The sunlight to be excluded by the sunshade is 
then direct sunlight scattered within the reception antenna 
(telescope) when the planet is near conjunction, i.e., at small 
Sun-Earth-probe angles (small solar elongations). 

A. Scattering from Rough Reflectors 

Recommended plans for the Optical Reception Develop- 
ment Antenna [2] call for use of a hexagonally segmented 
primary reflector made up of light-weight, composite panels 
having a root-mean -square surface roughness of 2{x m. 1 It is 
anticipated that sunlight directly incident on such a surface 
would produce intolerable scattering into the detectors, no 
matter what internal sunshades were used. Therefore, a pri- 
mary sunshade must be provided, capable of shading the 
primary reflector from direct sunlight whenever the antenna 
is used at more than the design minimum solar elongation. 

B. Thermal Effects on Visible Reception Antennas 

Sunlight will not heat sunshades to incandescence; there- 
fore reradiation from the sunshade will not be a problem for 
visible reception (even though it is a problem for infrared and 


1 P. N. Swanson, A Lightweight Low Cost Large Deployable Reflector 
(LDR), JPL Publication D-2283 (internal document). Jet Propulsion 
Laboratory, Pasadena, California, pp. 5-1-5-6, June 1985. 


millimeter-wave telescopes). Thermal distortion of the struc- 
ture and convection currents or heat extraction difficulties 
are perennial problems deserving further study. 

C. Communications Outages Near Solar Conjunction 

The orbits of most of the planets lie close to the same 
plane, the plane of the ecliptic, which is the plane of the 
apparent path of the Sun through the sky and also the plane 
of the Earth’s orbit. This means that the planets appear to 
approach the Sun as they are viewed from Earth. The times 
when the planets are close to the Sun are called conjunctions. 
The duration r d of a conjunction depends on the periods of 
revolution t of the Earth and T of the outer planet, and on 
the design minimum solar elongation E (the minimum solar 
elongation for communications is the maximum solar elonga- 
tion of the conjunction that causes the communications out- 
age). The period of recurrence r r of the conjunction depends 
on t and T. 

Communications with a probe on a mission to an outer 
planet will be blocked whenever the planet appears to come 
too close to the Sun, as in Fig. 1. Table 1 shows the duration 
of outages for various limiting solar elongations, and the 
period of recurrence, for the outer planets. The formulas on 
which the table was based are derived in Appendix A. 

The inclinations i of the orbital planes of each of the outer 
planets with respect to the Earth’s orbital plane are also given 
in Table 1. The line of intersection of the orbital planes of a 
pair of planets is called the line of nodes. Only Pluto’s orbit is 
highly inclined. This means that Pluto approaches a close con- 
junction only during the times when the Earth and Pluto are 
close to opposite ends of their line of nodes. At other times 
Pluto appears to pass at a variable angle (as much as 17 degrees) 
north or south of the Sun. The other outer planets move in 
orbital planes too close to the Earth’s to help much in reliev- 
ing the communications interference encountered near solar 
conjunction. 

II. Disadvantages of Conventional 
Sunshades 

The usual primary lightshade of a telescope is an internally 
blackened tube extending from the primary reflector to a 
short distance beyond the primary focus. This provides shad- 
ing for the secondary reflector also, in a Cassegrain or New- 
tonian arrangement. Other internal baffles may be added. If a 
sunshade is needed, it is usually added as an extension of the 
primary lightshade beyond the primary focus. Sometimes this 
extension is cut down to a sun visor in order to reduce weight, 
though that requires the operation of orienting the telescope 
axially, relative to the Sun. 
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The chief disadvantage of an extended primary lightshade 
is the length required in order to look at targets when the solar 
elongation is small, without allowing light to strike the pri- 
mary reflector. The length required is D cot E. If the telescope 
diameter D is 10 m and E - 12 degrees, the length is 47 m, 
which makes a very unwieldy telescope. 

A series of slats or flat plates may be inserted within the 
tube, such that the normal to the plates is perpendicular to 
the line of sight. If the slats divide the diameter into n spaces 
of equal thickness between them, then the overall required 
sunshade length is divided by n. The number n cannot be 
made very large, however, for two reasons. First, the slats must 
be made of some material having a finite thickness, and the 
sunshade cannot be kept in perfect alignment with the line of 
sight. This means that the slats will obstruct the view to some 
extent, contributing some fraction to the opacity of the tele- 
scope. Second, the slats will introduce additional diffraction 
and spread the image of the deep-space laser source. The field 
stop will then have to be opened to capture a reasonable frac- 
tion of the incoming signal power. If a planet or another 
extended object is in the background, the background level 
will increase as the field stop is opened and the performance of 
the communications link will be degraded. 

A. End-Mounted Sunshades 

The largest telescopes, used for astronomy, do not have 
sunshades associated with them. Astronomers use the Earth 
as a natural sunshade by doing their observing at night. This is 
necessary because the natural objects they look at emit inco- 
herent radiation which is too faint to be separated from the 
scattering in the daytime blue sky. 

The dome is the most expensive component of an observa- 
tory building, and the cost increases faster than the square of 
the diameter. (The log-log graph in Fig. 2 has a best-fitting 
slope of 2.02 for standard, electrically-driven, hemispherical 
domes from 3 to 1 1 m in diameter. The 37-m Keck dome is 
custom-made, more than hemispherical, and has special drives 
and sensors for precise positioning. The slope from the largest 
standard dome to the Keck dome is 3.81.) For this reason, 
among others, the dome is usually made only large enough 
to clear the swing sphere swept out by the motion of the 
telescope. 

1. Ground-based antennas. If an astronomical observa- 
tory with a large-diameter telescope were converted or rented 
for use as a ground-based optical reception station, an end- 
mounted sunshade of a reasonable length could not be accom- 
modated within the dome, for the reasons stated above. 

2. Space-based antennas. A space-based optical reception 
antenna could conceivably be sunshaded by a tube (or visor or 


even a flat plate) whose length was about five times the diam- 
eter of the antenna. Ingenious methods could be devised to 
transport such a structure to space, erect, and assemble it. 
However, the moment of inertia would be very large. A great 
deal of energy would be expended in orienting the telescope 
and sunshade while tracking a deep-space probe. 

B. Externally Mounted Sunshades for Ground-Based 
Reception 

Since most large telescopes are protected by a dome, it 
would be possible to mount a long tube externally on the 
dome. Alignment of the tube with the telescope is necessary 
but is not required to be very precise. A small computer could 
easily control the azimuth of the dome and the elevation angle 
of the sunshade as well as the azimuth and elevation of the 
telescope, when tracking an object and compensating for the 
rotation of the Earth. 

However, most observatory sites are on mountain peaks 
where they are subject to occasional high winds. Table Moun- 
tain Observatory, for example, reports clocking winds at 
90 m/sec (200 mph), which their domes survive. An exter- 
nally mounted sunshade would add considerable wind load to 
the dome. The dome would have to be strengthened for opera- 
tion in moderate winds, and the sunshade would have to be 
stowed securely whenever high winds or inclement weather 
was anticipated. 


III. Solution: The Integral Sunshade 

An integral sunshade for optical reception antennas is pro- 
posed to overcome the disadvantages detailed above. Figures 3 
and 4 are photographs of a glue-and-paper model of the inte- 
gral sunshade. The sunshade consists of a bundle of closely 
packed hexagonal tubes, aligned with the antenna line of sight, 
forming a structure that supports the secondary reflector. 
(This makes the sunshade an integral part of the telescope 
structure.) The outer edges of the outermost tubes extend 
around the primary reflector and form the primary sunshade. 
Inside, the tubes are cut off just short enough to provide 
clearance for the focused beam to the secondary reflector. The 
opposite ends are trimmed in the form of a spherical cap, to fit 
within the swing sphere of the telescope. 

A plan view of the reflector is shown in Fig. 5. The axial 
hexagons are those that straddle the x-axis. The central hexa- 
gon is numbered (0,0). The other hexagons are numbered first 
by their ring number (starting with the innermost) and then by 
their sequence number within the ring (starting with the axial 
hexagon). The numbered hexagons all have different surface 
figures in order to fit together into a parabolic reflector. The 
points of individual hexagons have been lettered A, B, C, D, E, 
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and F, counterclockwise starting with the point closest to the 
60-degree line. This lettering is illustrated for hexagons (1,1) 
and (3,3). 

The other ring hexagons are symmetrical with respect to 
rotations of 60 degrees. The axial hexagon on the outermost 
ring, (4,1) in Fig. 5, is called a corner hexagon. 

Figure 6 shows the integral sunshade concept as it would be 
for a Cassegrain optical reception antenna having a 10-m,//0.5, 
hexagonally segmented, four-ring, primary reflector. An x-z 
cross-section and a y-z cross-section are shown, each covering 
only the positive half of the x - or .y-axis. The reflector lies at 
the bottom of, and is tangent to, a 12-m-diameter swing sphere. 
The secondary reflector is the same size as the absent central 
hexagonal panel of the primary reflector. 

Another baffle surrounds the hole in the primary left by 
the absent central hexagon. This baffle consists of the frus- 
tum of a six-sided pyramid cut off by the intersection of two 
planes to form each edge. One plane is determined by the edge 
of the central hexagon and the primary focal point. The other 
is determined by the edge of the secondary reflector and the 
secondary focal point. The pyramid is illustrated in each cross- 
section of Fig. 6 by a line from the primary reflector slanting 
inward. 

Within the pyramid is a conical (or cylindrical) baffle de- 
signed to capture rays that enter the pyramid after passing 
through the innermost ring tube. This baffle is illustrated in 
each cross-section of Fig. 6 by a line from the center of the 
primary reflector slanting outward. 


A. Design Premises 

1 . Sunlight may not be allowed to shine anywhere on the 
primary reflector. Premise 1 could be violated, of course, by 
looking at a deep-space probe at less than the allowed solar 
elongation. Sunlight would not flood the entire primary until 
the solar elongation was equal to half the solar subtense. A 
small amount of sunlight scattering from the primary might 
be tolerable, depending on the parameters of the optical com- 
munications link. However, Premise 1 is used to define the 
minimum solar elongation for normal operation. 

2. A ray of sunlight is considered to be stopped when 
incident, however obliquely, on the blackened surface of any 
part of the sunshade. Premise 2 does not require the existence 
of a perfect absorber with no forward scattering. It only means 
that the absorption is adequate to reduce the interference 
caused by the remaining forward-scattered sunlight to toler- 
able levels. 


3. The sunshade parts and reflecting surfaces are consid- 
ered to be infinitesimally thin. Premise 3 actually renders the 
design conservative. In practice it is expected that there will 
be gaps of about 2 cm or so between the panels of the primary 
reflector. The walls of the hexagonal tubes will not have to be 
nearly so thick to form a very strong structure. The absorption 
of obliquely incident rays on their surfaces may therefore be 
improved by adding ridges or ring baffles consisting of thin 
plates cut to fit perpendicularly within the tubes, having a 
hexagonal hole punched in them the same size as the reflecting 
panel below. A light ray incident on the tube wall just above 
such a ring would experience two geometrical reflections, one 
from the tube wall followed by one from the ring baffle, such 
that the ray would actually be reflected back parallel to itself, 
as illustrated in Fig. 7. 

Ring baffles effectively reduce the chords across the tubes 
without introducing any additional geometrical obscuration of 
the telescope aperture beyond that produced by the segmenta- 
tion. Reduction of the chords without changing the lengths of 
the tubes means that the sunshade could be used at solar elon- 
gations slightly less than the minimum. 

4 . The truss supporting the primary reflector and the op- 
tics behind it can all be fitted in the space between the pri- 
mary reflector and the swing sphere. If Premise 4 cannot be 
fulfilled in fact, the dome will have to be made with a some- 
what larger clearance. 

The design equations are set forth in Appendix B. 

B. Design for Operation Within a Minimal Dome 

During the initial conception of this design it was observed 
that the top ends of the hexagonal tubes follow a curve that 
parallels, to some extent, the cone of the focused beam from 
the primary to the secondary. For an // 0.5 primary the paral- 
lelism is good when the diameter of the swing sphere is six- 
fifths of the diameter of the primary. This allows operation 
within a dome that fits very closely over the optical reception 
antenna. Such a dome and sunshaded telescope are illustrated 
in Fig. 8. The dome opening is far larger than that of conven- 
tional domes with meridional shutters, however, and the dome 
must be much more than hemispherical if the telescope is to 
be able to look horizontally or down to some minimal eleva- 
tion angle. 

The two shortest sets of tubes are then the tubes on the 
innermost ring, and those on the corners of the outermost 
ring. Rays entering along the longest chords within the inner- 
most tubes are stopped by the pyramid, however. The mini- 
mum solar elongation is equal to the arcsine of the ratio of the 
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longest chord to the shortest distance through the corner tube 
on the outermost ring from top to bottom perimeters. The 
longest chord is between the points of the tube. The shortest 
distance goes from either of the top outermost points to the 
opposite bottom inner point. 

In the design shown in Figs. 3 through 6 the minimum solar 
elongation E is 12.44 degrees. Allowing for ring baffles 1 cm 
wide reduces E to 1 1 .96 degrees. 

The minimum solar elongation may be cut approximately 
in half by inserting a set of plates or vanes between the points 
of each tube, so the cross section resembles an asterisk inscribed 
within a hexagon (Fig. 9a). The plates would run the length of 
the tube, and would be cut off at the ends to fit the primary 
focused beam and the swing sphere, just as the tubes are. This 
effectively subdivides the hexagonally segmented aperture into 
equilateral triangles. This time additional obscuration and dif- 
fraction are introduced. 

Reduction of the minimum solar elongation to one quarter 
can be accomplished by subdividing each of the equilateral 
triangles again with plates. The cross section resembles a six- 
pointed star superimposed over the asterisk and inscribed 
Within a hexagon (Fig. 9b). This structure would be stable 
without the circumscribing hexagon (unlike the asterisk struc- 
ture). The tubes could be designed with channels running the 
length of each point, and the six-point-star-and-asterisk vanes 
could be inserted in each tube whenever it was necessary to 
track an object at close solar conjunction. The vanes could be 
removed whenever operations did not require looking closer 
than 12 degrees of the Sun to eliminate the obscuration and 
diffraction the vanes cause. 

C. Application to a Space-Based Reception Antenna 

A space-based antenna would not have to swing within a 
prescribed sphere. However, the integral sunshade has a moment 
of inertia that is considerably smaller than that of an open 
tube providing similar sunshading, and the center of gravity is 
much closer to the primary focal point. These factors favor use 
of the integral sunshade for space -based optical reception an- 
tennas operating in the visible region of the spectrum. 

The structure of the sunshade is very strong and rigid. It 
may be used to mount the secondary reflector at a fixed dis- 
tance from the primary. The mass and added diffraction of a 
secondary-reflector support spider are eliminated. 

D. Summary of Design Advantages 

A very compact, manageable sunshade is provided. No geo- 
metrical obscuration not introduced already by segmentation 


of the primary reflector is added. The amount of diffraction 
added by the sunshade is very small. 

The swing sphere for the entire system is only slightly larger 
than the sphere needed to swing the primary reflector. No 
wind loads are added to the dome. Dismounting and stowing 
an external sunshade for anticipated inclement weather are 
not required. 

The center of the swing sphere is placed very close to the 
primary focal point. (A small adjustment of the focal length of 
the primary reflector would make the two points coincide, if 
that were desirable.) The design provides a rigid structure to 
support the secondary reflector above the primary. The mass 
and added diffraction of a secondary support spider are 
eliminated. 

E. Possible Extensions of the Design 

The minimum solar elongation is set by skew rays through 
the tubes on the corners of the outermost ring. It is possible 
to reduce the minimum solar elongation by adding some small 
additional baffles. 

One method would simply cap a portion of the outer top 
edge of the corner tube whenever the telescope is used at a 
small solar elongation. The area of the effective aperture 
would be reduced only slightly, and a somewhat smaller solar 
elongation would be allowed. 

Another method would add some short vertical baffles 
arranged along radial lines at the inner points of the bottoms 
of the corner tubes on the outer ring. The entering collimated 
beam from the deep-space probe and the focused beam to the 
secondary reflector would be obstructed by these vertical 
radial baffles only to the extent of their finite thickness. How- 
ever, they could be made long enough to come close to the 
surface of the primary reflector (to within a suitable clearance), 
and could intercept the skew rays that had previously limited 
the minimum solar elongation. The minimum solar elongation 
might therefore be reduced to a new limit imposed by skew 
rays in another set of tubes, either the nearest neighbors of the 
corner tubes on the outermost ring, or the tubes that form the 
next-to-innermost ring. 

F. Areas Requiring Further Study 

1. Use of radial baffles within the focused beam region 
between reflectors. Small radial extensions of the baffles on 
the outermost-ring corner tubes have been mentioned already. 

Inspection of Fig. 5 shows that some of the segmentation 
lines are radial, on the odd-numbered rings beginning with the 
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innermost. Since the integral sunshade forms a very strong 
structure, and radial plates introduce an additional geometrical 
obstruction proportional only to their thickness, a designer 
might consider extending the radial walls of the tubes down- 
wards to tie together the integral sunshade and the primary 
reflector support truss. Very little additional sunshading would 
be provided, but a more rigid overall structure would be ob- 
tained. Thermal analysis would have to show that the advan- 
tage in rigidity would not be offset by the thermal distortions 
caused by nonuniform heating of the sunshade. 

2. Thermal problems to be overcome. Absorption heating 
will occur on one side only of the tubes, with the depth of 
penetration dependent on the location of the tubes relative 
to the Sun. Due allowance must be made for thermal distortion 
of the structure, and possible dislocation of the secondary 
reflector. 

a. Ground-based antennas. Convection currents within the 
tubes will arise if a large temperature difference exists between 
opposite walls. These currents produce fluctuations in the 
density and refractive index of the air, and would blur the 
image of the deep-space laser source at the focal point of the 
system. The threshold for the onset of convection, i.e., the 
ratio of the temperature difference between opposite walls to 
the absolute temperature, is inversely proportional to the cube 
of the distance between the walls [3] . The walls must there- 
fore be highly thermally conducting, in order to reduce the 
temperature difference between opposite walls within a tube 
as much as possible. The central tube can be left open at the 
top, with a thermal shield over the secondary reflector at the 
bottom, in order to ensure uniform heating of its walls. This 
also suggests that the outer surfaces of the sunshade should 
be blackened, contrary to the normal practice of making the 
primary sunshade white on the outside. 

The tubes on the side closest to the Sun will be penetrated 
to a greater depth, and through a larger projected aperture, 
than the tubes on the side farthest from the Sun. When the 
deep-space probe is seen above the Sun this situation can lead 
to gravity-driven circulation of air that will enter the lower 
tubes, pass between the reflectors, and exit through the upper 
tubes. As long as the flow is laminar the blurring of the image 
will be minimal. However, the dynamics of this process require 
study. 


Forced outward convection of filtered air through all the 
tubes may be capable of providing necessary cooling, prevent- 
ing unstable or turbulent convection, and keeping the optics 
clean. 

b. Space-based antennas . Lack of convection will eliminate 
blurring of the optical image but may require introduction of 
heat pipes or other means of heat extraction. 

3. Use of the sunshade instead of a dome. Microwave radio 
antennas are often used without domes. It may be possible to 
provide tube caps and weatherization that would eliminate the 
need for a dome over the optical reception antenna, at a sub- 
stantial savings in cost. 

4. Mass reduction and deployability of a space-based inte- 
gral sunshade. Deployment of a low-mass integral sunshade in 
space represents a solvable construction challenge, especially if 
the integral sunshade is used as proposed to support the sec- 
ondary reflector in relation to the primary reflector instead of 
a support spider. 

IV. Conclusions and Recommendations 

A novel kind of sunshade has been proposed for large- 
aperture hexagonally segmented optical reception antennas, to 
permit optical communication even when the deep-space laser 
source is as close to the Sun as 12 degrees. Inserts in the tubes 
of the sunshade would permit operations at solar elongations 
as small as 6 or 3 degrees, at a slight reduction in effective 
aperture area and a small increase in diffraction spreading of 
the source image. 

The compactness of the sunshade effects a substantial cost 
savings when the optical reception antenna is ground-based 
and housed under a dome. The inner diameter of the dome can 
be almost as small as six-fifths of the aperture diameter. A 
space-based optical reception antenna would use much less 
energy to orient this sunshade than it would orienting a con- 
ventional sunshade of comparable functionality, and the mass 
and added diffraction of a secondary-reflector support spider 
are eliminated. 

A few design issues remain for investigation, such as the 
thermal distortion, convection currents with ground-based 
antennas, and heat extraction for space-based antennas. 
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Table 1. Recurrence and duration of solar conjunctions 


Elongation, deg 


Planet 

7\ sec 

V 

yr, d 

12 

6 

3 

1 

/, deg 





r d> d 
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T d- d 
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Mars 

59355300 

2 

49 

86 

43 

22 

7 

1.850 

Jupiter 

374320000 

1 

34 

32 

16 

8 

3 

1.309 

Saturn 

929604000 

1 

13 

28 

14 

7 

2 

2.493 

Uranus 

2651140000 

1 

4 

26 

13 

6 

2 

0.773 

Neptune 

5200270000 

1 

2 

25 

13 

6 

2 

1.779 

Pluto 

7837350000 

1 

1 

25 

13 

6 

2 

17.146 
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Fig. 1. Configuration of the Earth ( ®) and an outer planet P when 
approaching (solid lines) and leaving (broken lines) solar con- 
junction within solar elongation E. 






Fig. 3. Overview of a glue-and paper model of an integral sun- 
shade. The telescope looks through the sunshade from behind and 
below it, in this view. The hexagonal tubes are trimmed to a 
spherical shape. 


Fig. 2. Logarithm of dome cost versus dome diameter. Data sup- 
plied by manufacturers: A = Ash-Dome, C = Coast Steel, O = 
Observa-Dome Laboratories. 
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Fig. 4. Underside view of an integral sunshade model. The hex- 
agonal tubes are trimmed to form a six-sided pyramid, with the 
secondary reflector to be mounted at the apex and the primary 
reflector at the base. 


y 
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Fig. 5. Plan view of a hexagonally segmented reflector. 



1,1 = INNERMOST RING TUBE 




Fig. 6. Cross-sectional views of an integral sunshade on a Cassegrain optical reception antenna 

within a swing sphere. 



10-m APERTURE 



Fig. 9. Cross sections of (a) asterisk vanes within a haxagonal tube 
and (b) six-point-star vanes superimposed over asterisk vanes. 



Appendix A 

Duration and Recurrence of Outer-Planet Conjunctions 


The orbits of the Earth and an outer planet P of the solar 
system are shown in Fig. 1 as viewed from above the north 
pole of the Sun. The respective periods of revolution t and T 
are related to the respective orbital semimajor diameters r and 
R by Kepler’s law, r 3 /t 2 =R 3 /T 2 ) which shows that the outer 
planet moves at a slower angular rate 2tiIT than the Earth’s 
angular rate 2ir/t. The zero of angular measurement is desig- 
nated as the Earth’s position at the time that the outer planet 
is seen from Earth at an elongation angle E east of the Sun, 
when the planet is approaching conjunction. The analysis will 
be simplified by approximating the orbits with circles lying in 
the same plane and concentric on the Sun. The distance be- 
tween the Earth and the outer planet is p, and the angular 
position of the outer planet is © 0 . Trigonometric relation- 
ships for the solid-line Earth-Sun-outer-planet triangle yield 


The orbital position of the Earth at any time r is 6 = 2i rr/f, 
and that of the outer planet is © = 0 O + 27 tt/T. At the time 
r d when the outer planet has passed conjunction and is seen at 
an angle E west of the Sun (i.e., at the end of the time that the 
outer planet is seen within an elongation E of the Sun), the 
Earth has reached the position Q d , the outer planet has reached 
the position © d , and the configuration is represented by the 
broken-line triangle. The solid- and broken-line triangles are 
congruent since the radii and the elongation angles are equal, 
so the two obtuse angles are equal, 


The solution for the duration is 



sin(27r - © 0 ) sinE 


R 


r 2 + R 2 - 2rR cos( 27 t - © ) 


Squaring the first relationship, substituting for p 2 from the 
second, and replacing sin 2 (27r - © 0 ) with 1 - cos 2 (27r - © 0 ) 
leads to a quadratic equation whose solution is 


cos 1 

sin 2 E ± cos E 

1 1 - — S in 2 £ 


r \ 

1 

n 


The lower sign corresponds to the solution sought. The upper 
sign leads to a solution that becomes degenerate if one con- 
siders conjunction of a planet in the same orbit as the Earth, 
so R - r. (The upper-sign solution would correspond to a posi- 
tion coincident with the Earth, and the ratio (sin E)/R would 
be equal to the ambiguous form 0/0.) The ratio r/R may be 
replaced with ( tjT ) 2 / 3 in order to work only with the plane- 
tary revolution periods. 



The next occurrence of an epoch of conjunction begins at 
r r , the first time the difference between the orbital positions 
(© - 6) modulo 2 7 r is again equal to the original difference © 0 . 
The outer planet moves more slowly so the difference becomes 
negative and 2 t r will have to be added. This gives 

2m 2m 

0 o + — + 2?r = 0 o 

1 
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Appendix B 
Design Equations 


The integral sunshade for an optical reception antenna was 
designed on the basis of a parameterized model of the antenna. 
The antenna is assumed to consist of a parabolic primary re- 
flector and a secondary reflector in the form of part of the 
upper sheet of a hyperboloid of revolution. The primary re- 
flector is hexagonally segmented and rests in the bottom of 
the swing sphere. The secondary reflector is of the same size 
as the absent central hexagon of the primary reflector. The 
secondary reflector is positioned where it will be filled by the 
focused beam from the primary reflector, and the secondary 
focus is centered on the projected surface of the primary 
reflector. 

The following definitions, parameter values, and equations 
were used. 

Origin: Center of primary reflector rim circle 
Positive z axis: Along the line of sight 
Positive jc axis: Through the center of a corner hexagon 
R = 6 m = radius of telescope swing sphere 
D = 10 m = aperture diameter 
n = 4 = number of rings 

z c ~ \/R 2 - (D 2 / 4) = 3.317 m; center of swing sphere is 
at (0,0, z c ) 

= polar angle for swing sphere 

x - Rsin(\p) = x -coordinate of swing sphere 

z ~ Rcos(\p) + z c = z-coordinate of swing sphere 

w = D/(2n + 1) = 1.111 m = width across flats of a hex- 
agonal segment 

s = w/V 3 = 0.642 m = side length of hexagonal segment 
/ = 0.5 = /number or focal ratio of primary reflector 

Paraboloidal primary reflector surface equation 
_ (x 2 +y 2 ) D 

z 4 Jd ~W 

Dj(\6f) = 1.250 m = reflector concavity 

fD = F = 5.000 m = distance from reflector center 
to primary focus 


z Fp = fD - [Z)/(16/)j = 3.750 m; primary focus is 
located at (0,0, z Fp ) 

h = 0.833 m = axial step height between tube edges 
at reflector ends 

/D - [Dl(l6f)\ 

h = 2 — 2^n — 

f fDw = (/D/2) -/w+ [w/(l 6f)\ - 2.083 m = distance in- 
volved in determining the secondary reflector 

fs = ( pD 2 /4 ) + (w 2 /4) + fj Dw = 10.899 m 2 = area 
factor involved in determining the secondary 
reflector 

a 2 = (f s ~ //* -/ 2 D 2 // Dw )/2 = 3.846 m 2 = hyper- 
boloidal secondary major parameter squared 

b 2 = c 2 - a 2 = (/ 2 D 2 /4) - a 2 = 2.404 m 2 = hyperbo- 
loidal secondary minor parameter squared 

z h = (/D/2) - [D/(16/)j = 1.250 m = center of hyper- 
fa oloidal secondary 

y om ip ~ -[(3w/2)+l]s = -4.490 m = outer point of outer 
midlateral hexagon on primary 

z omip = -P/06 /)] + [yl m , P l(4fD)\ = 0.242 m = outer 
point of outer midlateral hexagon on primary 

z = z h + \Ja 2 + (x 2 + y 2 )(a 2 /b 2 ) = equation of hy- 
perbolic secondary reflector 

z beO ~ nh = m = height at bottom of central tube 

edge 

A coordinate system was established in the plane of the 
primary reflector to make the positions of most features of the 
hexagonal grid equal to an integral number of units. In some 
cases, however, half-units had to be counted. 

u = w/2 = 0.555 m = 1 unit in the ^-direction 

t = s/2 = 0.321 m = 1 unit in the j-direction 

The cutting of the lower ends of the tubes to clear the 
focused beam from the primary reflector forms a kind of 
“ceiling” over the primary. This ceiling is in the form of six 
planes, convergent at the primary focus, forming a six-sided 
pyramid. The base of the pyramid was chosen to rest on the 
outer points of the outer-ring point hexagons. Each plane was 
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then determined by the primary focal point and two other 
base points, of which the following two are typical: 

x = 9 (in units of u), y cl = 1 (in units of f), z cl = 0 m 
x c2 = 5 (in units of u ), j> c2 = 13 (in units of t ), r c2 = 0m 

The minimum elongation angle E is the smallest angle that 
allows a ray to penetrate the sunshade to the primary reflector 
surface. This ray is one that crosses from a low point on the 
top of one hexagonal tube to a high opposite point on the 


tube at the bottom, in a short tube, provided that afterwards 
the ray is incident on the primary reflector. The angle is cal- 
culated by giving the coordinates of the two opposite points of 
a tube, as follows, and trying various tubes until the largest 
angle E is found. In the following, subscript t refers to the top 
point, and b to the bottom point. 

x f = 9 (in units of u), y f - -1 (in units of t) 

z f = z c + y/R 2 -x]u 2 - y]t 2 = coordinate (on sphere) 

x b = 7 (in units of u), y b = 1 (in units of t) 


z b = coordinate (focused beam clearance) 


Z Fp 


- Z cl^ C 2 -( Z Fp - Z c2^cJ + V*cl(*Fp - Z cJ- X cl( Z Fp ‘ Z cl » 

x el y e2 -x ei y el 


E = minimum elongation angle 

ISO , y(*r-**) a “ 2+ 0,-^) a ' 2 

= — tan~* 

Each unique off-axis tube stands over an area completely 
within a sector defined by the positive x-axis and a radial line 
at 60 degrees from it. The axial tubes are split by the x-axis, 
with their positive halves standing over the sector just defined. 
The points of the tubes were lettered A, B , C,£>,£, andF, in 
order counterclockwise starting with the point nearest the 
60-degree sector line. The top and bottom coordinates for cut- 
ting the tubes were then found as the intersection of the verti- 
cal lines of the tubes (the projections of the hexagon points) 
with the “ceiling” or with the swing sphere. 

i = number of the ring (center tube is 0) 

/ = number of hexagon in ring, from 1 on x-axis up to i 

! 1 if i = 0 (l if i = 0 

2 1 - / otherwise I 3/ - 4 otherwise 


! 1 if z = 0 

2z-/+l otherwise’ 


il 


if z = 0 


= 


if i = 0 


1 3/ - 5 otherwise 
[l if i = 0 


c j 2i -j + 2 otherwise ’ j 3/ - 4 otherwise 


! 1 if i = 0 _(l 

2 i -j + 2 otherwise ’ ^ D )3/ • 


if i = 0 
2 otherwise 


if * = 0 


if i = 0 


E I2/-/+1 otherwise’^ i 3/ — 1 otherwise 


if i = 0 


if i = 0 


F 1 2/ — / otherwise’^ )3/-2 otherwise 


Let X stand for one of the letters A through F. Then 


z top T 2 


Z = 7 _ 

bottom Fp 


X x K Z Fp- Z dK2 cJ + l^'Kl( 2 Fp- Z c2)- X c2(V p - Z cl)l 

V c2 -Wci 
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A six-sided frustum of a pyramid, placed around the aper- 
ture in the primary reflector, prevents the entrance of any re- 
scattered stray light from the primary reflector into the de- 
tector. It also limits the view from the secondary focal point 
to the secondary reflector. The larger base of the frustum is 
the same size as a hexagonal segment. The height above the 
primary reflector rim plane is determined by the intersection 
of a ray from the secondary focal point to a midlateral point 
of the secondary reflector and a ray from the primary focal 
point to a midlateral point of the central hexagon on the pri- 
mary reflector. The calculations involve a pyramid factor 
f pyr = FI[Zte o + D/(16 f) + F] = 0.522. The height above the 
primary reflector mirror rim plane is z Fp - Ff pyr = 1.145 m. 
At the top of the pyramid, the distance to the edge from the 
center is wf pyr /2 = 0.290 m, and the distance to the point 
from the center is - sf pyr = 0.335 m. 

A hexagonal cone (or a hexagonal cylinder) from the sec- 
ondary focal point around the beam from the secondary 
reflector may be added to capture rays that are rescattered 
within the structure, or that enter through the inner ring tubes 
at less than the minimum elongation but would strike the pri- 


mary reflector surface within the central hexagon. The height 
and spreading of the cone are determined by the intersection 
of the ray from the secondary focal point to a midlateral point 
of the secondary reflector and a ray passing through the inner- 
most ring tube from the top outer midlateral point to the 
bottom inner midlateral point. This gives the distance from the 
center to the top edge of the cone as 


-3z fce0 - [20/(16/)] 
w — — = 0.127 m 

2 Vi + 2WP/(4/)] 


and the height (relative to the primary reflector rim plane) as 


te 1 


V>mfA 


3/2 


^ te\ “ Z heft) 


be 0 ' 


= -0.199 m 
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Optical reception antennas used at a small Sun-Earth-probe angle (small solar elonga- 
tion E) require sunshading to prevent intolerable scattering of light from the surface of 
the primary mirror. An integral sunshade consisting of hexagonal tubes aligned with the 
segmentation of a large mirror has been proposed for use down to E = 12 degrees. For 
smaller angles , asterisk-shaped vanes inserted into the length of the hexagonal tubes 
would allow operation down to about 6 degrees with a fixed obscuration of 3.6 percent. 
Here we investigate two alternative methods of extending the usefulness of the integral 
sunshade to smaller angles by adding either variable-area shutters to block the tube cor- 
ners that admit off-axis sunlight or by inserting slats (partial vanes) down the full length 
of some tubes. Slats are effective for most operations down to 6 degrees ; and obscure 
only 1.2 percent. For E between 1 0. 75 and 12 degrees , shutters cause even less obscuration. 


| 
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I. Introduction 

Deep-space-to-earth optical communication will require the 
development of a large-aperture ground-based reception an- 
tenna. Such an antenna, SORDA, is described in [1] . To re- 
ceive data from a deep-space probe during the daylight hours, 
it is essential to shade the antenna primary mirror from all 
sunlight. Various sunshade designs have been considered. The 
best so far, the integral sunshade, is described in [2] . The inte- 
gral sunshade segments the aperture with a bundle of long 
hexagonal tubes. During solar conjunction, when the space 
probe appears to be close to the Sun, the integral sunshade 
blocks sunlight incidence on the primary mirror as long as the 
angle E seen from the Earth between the Sun and the probe (the 
SEP angle or the solar elongation) is greater than 12 degrees. 

When the solar elongation is less than 12 degrees, the inte- 
gral sunshade would admit “chinks” or oddly-shaped patches 
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of sunlight on the primary mirror, as in Fig. 1(a). The chinks 
would grow as the solar elongation was reduced; see Fig. 1(b). 
In [2] it was proposed that vanes (six flat plates arranged in 
cross section like an asterisk) be inserted in the hexagonal 
tubes of the sunshade to permit operation within 6 degrees of 
the Sun. The finite thickness of the vanes would obstruct the 
antenna aperture (that is, reduce its effective collecting area). 
If the width across the flat sides of the hexagonal tubes is 
1.11 m and the effective vane thickness is 1 cm, the area re- 
duction is 3.6 percent. 

Herein, we compare two modified approaches to the sun- 
shading problem. Since the chinks begin from sunlight leakage 
through the extreme corners of the hexagonal tubes, the cor- 
ners may be blocked with appropriately shaped partial shut- 
ters, whose area can be increased as the solar elongation is 
reduced. These shutters would also obstruct the aperture, but 
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at first not as much as the vanes would. The distribution of 
sizes and the additional obstruction caused by the partial 
shutters are analyzed in this article. Alternatively, one may 
insert a slat down the length of each tube as the decreasing 
solar elongation exposes it to sunlight penetration. We will 
compare the effectiveness of the shutters to the slats. 


II. Review of the Integral Sunshade 

Sunshades used with small antennas are usually an exten- 
sion of the primary lightshade between the objective and eye- 
piece, or between the secondary and primary mirrors. The 
closer the antenna is pointed toward the Sun, the longer the 
sunshade must be. End-mounted sunshades become impracti- 
cal for large antennas. For instance, at£= 12 degrees, the sun- 
shade length must be five times the aperture diameter. In one 
previous concept, the sunshade was to be mounted on the 
exterior of the antenna dome. Later it was observed that, by 
segmenting the sunshade in a hexagonal pattern similar to that 
proposed for the primary mirror, the sunshade may be short- 
ened enough to mount on a fast-primary antenna and fit al- 
most within the spherical volume swept out by the motion of 
the primary mirror itself. This configuration saves cost by 
minimizing the dome enlargement required to accommodate 
the sunshade. It eliminates the problem of having to remove 
and secure the sunshade during times of high winds. The inte- 
gral sunshade can also provide a strong, rigid support structure 
for the secondary reflector, instead of the usual spider. 

The hexagonal tubes extend at one end as close as possible 
to the envelope of the focused beam from the primary mirror, 
and to the reception station dome at the other end. If employed 
with a segmented primary mirror, the tubes would be aligned 
with the segmentation lines to minimize obscuration. The 
integral sunshade is short enough to fit within a dome whose 
diameter is 6/5 the diameter of the primary mirror. 

The current design calls for a unit composed of sixty-one 
tubes. The width across the flats of each tube is 1.1 1 meters. 
The integral sunshade has the appearance of a honeycomb, 
concave in the shape of a pyramid on the bottom, and convex 
in the shape of a sphere on the top. The sunshade and antenna 
are supported at elevation pivot points on opposite sides of the 
sunshade by a yoke whose base may be turned in azimuth. 
Within each tube, along the sides, are ring baffles to intercept 
unabsorbed glare light reflected from the side of the tube at 
grazing incidence, and to redirect it back out the top of the 
tube. The ring baffles make the effective wall thickness of 
the integral sunshade about equal to 1 centimeter. The in- 
tegral sunshade excludes sunlight from the primary mirror ex- 
cept when the solar elongation is less than 12 degrees. 


III. Review of Solar Conjunctions 

Most space probes, particularly those on missions to the 
planets, appear to approach and recede from the Sun because 
of the orbiting of the Earth and of the space probe. The times 
when this happens are called epochs of solar conjunction. The 
chief reason for tracking a probe near the Sun is that the 
probe may be near a planet that has periodic solar conjunc- 
tions. An outer planet will appear to approach the Sun from 
the east, pass close to it above or below or perhaps exactly 
behind it, and then recede to the west, over a period of several 
days. The path above, behind, or below the Sun depends on 
the inclination and orientation of the planet orbit relative to 
the Earth orbit. During the days of closest approach, or when 
the planet moves directly behind the Sun, there will be a 
communications outage. The duration of the outage may be 
reduced by reducing the minimum solar elongation at which 
optical communication is possible. 

IV. Design Complications 

Fitting the upper part of the sunshade to the swing sphere 
leads to some complications, which have been turned into 
opportunities for design economies. Because of the variations 
in the lengths of the tubes, the shading characteristics are 
not uniform. In general, the tubes nearest the Sun will begin 
to admit light at small elongation angles before the tubes 
farther away. This means that small shutters can be added on 
the corners of some tubes while others are left open, with 
little overall obstruction of the collecting area. 

The exact shape and placement of the shutters, and the 
sizing of each one, depend not only on the solar elongation, 
but also on the direction of the line seen in the sky from the 
probe to the Sun. The line will appear to rotate slowly rela- 
tive to the segmentation pattern as the antenna moves in 
azimuth and elevation to track the probe over a maximum of 
about ten hours of observing, from some minimum elevation 
angle at rising to the same angle at setting. The shutters would 
also have to move and change in size appropriately, or else 
be made a little larger than the absolute minimum necessary. 
The geometry of this problem is very complicated and still 
under study. Further investigation will also determine the 
exact conditions under which a slat will be effective in block- 
ing solar penetration. 

Operations and usage will affect the detailed design and 
implementation of the shutters. During any given day, the 
shutters may remain about the same size, but their shape and 
placement depend on the path taken by the planet during 
conjunction. The number of slats required would also be 
constant for a day, but some of them might have to be removed 
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and turned during a day (or else two slats in the form of an 
“X” would be required.) After the time of closest approach 
during the conjunction, the shutters or slats must all be 
switched to the other side of the antenna. The switching could 
be effected most easily by making the range of the elevation 
axis a full 180 degrees, and by rotating 180 degrees in azimuth 
also. However, the difficulties of mounting the mirror seg- 
ments to maintain their alignment during a reversal of the 
gravity vector would probably limit the elevation range to 
about 1 10 degrees. 

V. Methods of Reducing the Minimum Solar 
Elongation for Optical Reception 

The first alternative would be to block a corner or side of 
each vulnerable tube at the outer end. A study of this ap- 
proach has been undertaken. In the study, the integral sun- 
shade structural plates were taken to be infinitesimally thin. 
The 1 -centimeter thickness added by the ring baffles would 
allow operation about 0.5 degree closer to the Sun. Thus far, 
only the worst case has been analyzed. The Sun will penetrate 
the shade most easily if the Sun is shining at an angle across 
opposite vertices of the hexagons. It will begin doing so when 
E is less than 12 degrees. (In the best case, when the Sun is 
shining perpendicularly to opposite edges of the hexagons, 
sunlight penetration does not begin until E becomes less than 
about 10.5 degrees.) Since different tubes would require 
different amounts of extra shading, the size of the added 
shade in each of the tubes would vary. Figure 2 shows the 
sizes and shapes of the shutters in the case where E = 9 degrees 
with an attendant loss of 3.6 percent of the collection area. 
The case where £ = 6.5 degrees is illustrated in Fig. 3. Here 
the loss of signal would be 26.7 percent. The amount of the 
remaining collecting area has also been calculated and graphed 
in Fig. 4. The collecting area drops off to 73.3 percent at an 
elongation angle of 7.049 degrees. (Elongation angles were 
calculated and shown in Fig. 4 for the infinitesimally thin- 
walled sunshade. Elsewhere in this report they have been esti- 
mated and reported to be about 0.5 degrees smaller because 
the ring baffles will be used.) Further study is necessary to 
analyze the problem of shading as the angle of the Sun in 
relation to the target changes. It appears, however, that only 
a small fraction of additional obscuration is required to permit 


uninterrupted observation for as long as ten hours, while the 
Sun angles change from early morning to late afternoon. 

A second alternative calls for inserting a slat into each tube 
as shading is required. This will cause a 1.2-percent loss of 
signal to the corresponding mirror segment. As the solar 
elongation decreases from 12 degrees, the total receiver area 
loss for the telescope rises from 0 to 1.2 percent. This loss is 
much more acceptable than the larger losses mentioned earlier 
for shutters. However, slats are much larger than shutters and 
potentially more difficult to install or implement for auto- 
matic placement. They require reorientation (just as the shut- 
ters do) as the Sun angle changes. 

As illustrated in Fig. 5, there is a range of angles from 10.75 
to 12 degrees for which the shutters give slightly less obscura- 
tion than the slats. As the elongation diminishes from 10.75 
degrees, the performance of the slats relative to the shutters 
increases dramatically. 

VI. Conclusion 

As currently conceived, without any additional shading, the 
integral sunshade will block unwanted solar interference for 
any solar elongation down to 12 degrees. By attaching variable- 
area partial shutters at the ends of some of the tubes, it will be 
possible to continue to receive optical communication from a 
space probe with a loss in signal power varying from 0 to 3.6 
percent as the elongation is reduced from 12 degrees to 9 de- 
grees. Slats inserted across the corners and along the length of 
the hexagonal tubes would cause an overall signal loss vary- 
ing from 0 to 1 .2 percent as the solar elongation varies from 
12 degrees to 8 degrees. The signal loss with shutters is slightly 
less than the loss with slats for elongations from 12 degrees 
down to 10.5 degrees; for smaller angles the loss is decidedly 
greater. If the optical communications system performance 
is such that a greater signal loss can be accepted, it may be 
more convenient to use shutters instead of vanes even at 
elongations as small as 6.5 degrees. The shutters should be 
much easier to fabricate than the slats, and it should be much 
easier to actuate the shutters than to install the slats when 
needed. Both shutters and slats should be considered during 
the design of the optical reception antenna. 
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Fig. 2. Partial shutters as viewed along the length of the integral 
sunshade tubes, when the projected Sun vector (arrow) is directed 
parallel to the line joining the hexagon corners from lower left to 
upper right. This figure shows shading necessary for E = 9 degrees. 


Fig. 1. A model of the integral sunshade. The primary mirror is to 
be mounted on the back, facing forward, (a) Light incident at this 
small angle relative to the sunshade axis is admitted by the tubes on 
the upper right that show patches of the background, (b) At this 
smaller angle more tubes show the background and admit more 
light. 
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1.2 



Fig. 3. Partial shutters when sunlight is incident along the same 
line as in Fig. 2. Here we see the shading necessary for E = 6.5 
degrees, an extreme case in which some shutters block a third of the 
tube area. 



Fig. 4. Collecting area reduction as a function of solar elongation 
when minimal partial shutters are used with the worst sunlight 
incidence direction (when the sunlight vector as projected to the rim 
plane of the primary mirror is parallel to a line joining opposite 
vertices of a hexagon). Numerical values are based on the infini- 
tesimally thin-walled integral sunshade model. 



Fig. 5. Comparison of collecting area reduction for shutters 
and slats. 
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The effect of Earth albedo variation on the pointing and tracking subsystem of a 
planetary optical communication package is analyzed . By studying the Cramer-Rao bound 
of the tracking error variance , it is shown that , when the Earth albedo is precisely known , 
the variance in spatial tracking error is inversely proportional to the total signal count. In 
contrast , a small uncertainty in the Earth albedo can result in an irreducible error in the 
tracking subsystem. 


I. Introduction 

Accurate spatial acquisition and tracking are critical for the 
operation of free-space optical communication systems. In 
order to maintain the signal power loss to within an acceptable 
level, tracking accuracy on the order of 1/10 to 1/20 of the 
transmitted beamwidth is generally required. For a system 
operating at 0.5 -/urn wavelength using a 30-cm aperture, the 
desired pointing accuracy will be on the order of 0.1 jurad. 
Since the angular resolution of the optical system is roughly 
equal to its transmitted beamwidth, the narrow pointing re- 
quirement implies that the spatial acquisition subsystem must 
derive a pointing reference to within 10 percent of the mini- 
mal spatial resolution. Because both the transmitter and the 
receiver are typically in motion, the tracking information must 
be derived within a time period during which the receiver may 
move a significant distance within the transmitter field-of-view. 
In some systems, the residual vibration due to the mechanical 
system will be much larger than the desired pointing accuracy. 
For these systems, the tracking information must be obtained 


at a rate that is higher than the vibration frequency so that the 
optical system can effectively compensate for vibration- 
induced error. 

The performance of spatial tracking algorithms has been 
investigated by several authors [1] , [2] . Most of these studies, 
however, were carried out with the assumption that a beacon 
signal is available as a pointing reference. These studies gener- 
ally suggest that the effective pointing error decreases with 
increasing beacon strength or, effectively, with increasing inte- 
gration time. In the latter case it is assumed that the beacon 
strength remains constant so that increasing the integration 
time, i.e., decreasing the tracking loop bandwidth, will result 
in an effective increase of the detector signal-to-noise ratio 
(SNR). In some cases, however, an active pointing beacon can 
be either undesirable or unfeasible. In these cases it may be 
desirable for the transmitter to use the a priori knowledge of 
the receiver to derive the pointing information. For instance, a 
spacecraft may use the sun-lit Earth as a pointing reference 
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and derive the actual receiver location using simple geometric 
rules. This scheme can be particularly attractive for a deep- 
space optical communication system for which the uplink bea- 
con may require several kilowatts of radiated power. 

* 

The problem of deriving a pointing reference is equivalent 
to locating the image of the object on the receiver focal plane. 
For an extended object that can be resolved by the telescope, 
the optimal maximum-likelihood tracking algorithm has been 
derived, and its performance has been extensively investigated. 
The results generally state that, given the known source inten- 
sity distribution, the variance of tracking error will be inversely 
proportional to the detector SNR. Unfortunately, for most 
applications, the source brightness distribution is not precisely 
known. Consequently, there will be an error associated with 
the spatial tracking subsystem. The purpose of this report is to 
analyze the effect of uncertainties in source brightness on the 
spatial tracking subsystem. 

The rest of this paper is organized as follows. Section II 
describes the conventional maximum-likelihood tracking algo- 
rithm for determining the pointing reference. The effect of 
uncertainties in source brightness will be analyzed in Section 
III by studying the Cramer-Rao bound on the tracking error 
variance. A typical planetary optical communication package 
using the sun-lit Earth as a pointing reference will then be 
described and the impact of Earth albedo variation on the 
tracking error will be studied in Section IV. Finally, the results 
from the study will be summarized in Section V. 

II. Maximum Likelihood Spatial Tracking 
Algorithm 

In this section the Maximum Likelihood (ML) algorithm for 
determining the angular position of the receiver is derived. It 
is assumed that the shape and orientation of the receiver is 
known. The problem of estimating the angular coordinate of 
the receiver is equivalent to estimating the location of the re- 
ceiver image on the receiver focal plane. Without loss of gener- 
ality, this is equivalent to locating a fixed reference point on 
the image. For simplicity, the reference point is chosen to be 
the geometric center r of the image. Consequently, the prob- 
lem of spatial tracking can be reduced to the problem of esti- 
mating the geometric center given the detector photocount 
statistics and the prior knowledge of the source brightness 
distribution, / Q (p), where p is the distance to the image center. 


granularity of the optical signal, the output of the (ij) th pixel 
k.j will be a Poisson- distributed random variable with a mean 
X,y (/*) where 

\,(r) = I I 0 (P-r)d 2 p (1) 

JAij 

In writing Eq. (1), I Q (p - r ) denotes the brightness distribution 
of the image where the geometric center is displaced by an 
amount r, and the integral is over the surface of the (i 9 j) th 
pixel. 

The problem of deriving the transmitter pointing reference 
is therefore reduced to the problem of estimating the deviation 
r from a set of detector photocounts, { Ar /y -} . The maximum 
a posteriori (MAP) estimator [3] of the deviation IAP is given 
by 

'map = ar § | m f- i p ( r \{ V)}j ( 2 ) 

If it is assumed that the prior probability distribution of r Q is 
uniformly distributed, then the decision rule can be reduced to 
the following maximum-likelihood (ML) rule: 



where it is assumed that k { - are independent and Poisson- 
distributed with parameter and observed that the log- 

arithmic transformation does not change the maximum of a 
function. The ML decision rule can be further simplified by 
realizing that 

N N 
/=! /-I 

is proportional to the total power received by the receiver, and 
is therefore independent of the location of the geometric cen- 
ter r. Consequently, the ML decision rule can be written as 


The problem of spatial sstimating is complicated by the fact 
that the receiver has a finite spatial resolution. This is because 
the tracking detector, which is typically a charge-coupled de- 
vice (CCD), has discrete spatial cells (pixels) that occupy a 
finite area. Each pixel output represents the total intensity of 
light impinged on the pixel area. Furthermore, because of the 


! jV N 

EE log \y(r) j 

/= 1 /-I 


(4) 


The performance of the estimator can be estimated by using 
the Cramer-Rao lower bound (CRLB) [1] 
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where 


Var(|r-r 0 |)> 
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E(r) = 2 k tj log \ tj (r) 

a 

and E[x] denotes the expectation value (over {k^}) of the 
variable x. By using the assumption that ky are independent 
and Poisson-distributed, the lower bound can be further re- 
duced to 


Var(|r-r 0 l) > 



(d\.(r)/dy) 2 + (d\..(r)/dx) 2 

V r) 


\r^(d\j(r)/dy) 2 ' 


\^(H jj (r)ldx) 2 


V 4 (3X v (r)/3jc><dX w (r)/a^)"l * 

V ^ j 


Lv J 


L v 


(6) 


Given the source intensity distribution, / 0 (p), the variance of 
tracking error can be calculated. It should be noted that the 
CRLB is a lower bound on the estimator error. However, it 
provides an analytically tractable expression, and is therefore 
very useful in estimating the performance of the tracking 
system. 

Note that the variance in Eq. (6) is in general a function of 
r 0 , the actual geometric center. Furthermore, the variance 
depends on both the shape of the image as well as its intensity. 
To study the effect of increasing source intensity, or equiva- 
lently, increasing the integration time, one can normalize the 
receiver count parameter {\y(V)} as 


tyo = W r) (7) 

where is the average number of photons received over the 
tracking sensor, and gy(r) is the fraction of photons that falls 
onto the ( ij) th pixel. By definition, 

Efyfr) = 1 

a 

Given the above definition, the CRLB can be written as 


Var(|f ■ 


r o'> a TT 
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(dg^/dy) 2 + ('dg..(r)lbx) 2 




O ($g ..(r) /dx) 2 


y (dg ij {r)ldx)(dg i .(r)ldy) 
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[v J 



( 8 ) 


It is easily seen that the performance of the tracking sys- 
tem improves with increasing signal power, N Q . The func- 
tion G(r 0 ) in Eq. (8) depends only on the shape of the image, 
and not its intensity. Figure 1 shows the values of G(r Q ) 


for two very simple cases. For a more general image shape, 
G(r 0 ) is very difficult to calculate. However, it can be seen 
from Eq. (8) that the lower bound in error variance is mini- 
mized when 
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and 



The expressions in Eqs. (9a) and (9b) are greater for images 
with high contrasts, i.e., images that contain pixels with high 
\bg i j{r)/bx\ and \bg^{r)/by\. Since the reference image is 
usually much brighter than the background, the partial deriva- 
tives are greater near the image border. It follows that the 
CRLB is smaller for images with better defined borders, i.e., 
images for which bg^/dx and bg^/by are large. 

III. Albedo Variation 

The derivation above shows that, when the source intensity 
distribution is known, the variance in estimating the image 
location decreases linearly with increasing signal power. For a 
sufficiently bright source, the variance is negligible. 

Unfortunately, the derivation that leads to Eq. (8) assumes 
that the exact source intensity distribution I Q (r) is known at 
the receiver. In some cases, the intensity distribution of the 
receiver can be quite unpredictable. For example, for a deep- 
space vehicle using the sun-lit Earth as the pointing reference, 
the albedo variation of the Earth can be caused by weather 
patterns and changing surface conditions. Furthermore, these 
conditions are in general time-varying so that they cannot be 
expected to remain constant. 

In order to quantify the effect of intensity uncertainty on 
the spatial tracking subsystem, some assumptions on the inten- 
sity error distribution are required. For this analysis, it is 
assumed that the estimated source intensity distribution /(p) 
differs from the actual source intensity / 0 (p) by a small 
amount 7j(p). Furthermore, it will be assumed that 7 x (p) is a 
zero -mean Gaussian random process with an autocorrelation 
function 


0(p,p') = </ 1 (p)/ 1 (p')> 


( 10 ) 


^P(bg if (r)/bxy 

a (r\ 


X%(r)lbyy 


The maximum likelihood estimator must estimate the loca- 
tion of the image based on an incomplete estimate of the 
source distribution. In other words, the estimate ? for the 
image center can be written as 


r = arg max P ({k..} \ r, / (p)) 


arg ma xF'(r) \ 


where F\r) = log [/ > ({fy}|r, /(p))] is the likelihood function, 
and we have used the fact that a logarithmic transformation 
does not affect the location of a functional maximum. 

Given the formulation of the estimator in Eq. (11), the vari- 
ance of the estimation error can again be given in terms of its 
Cramer-Rao lower bound as in [1] 


Var(|r-r 0 l)> 


- pf * p 

* IM * [pf] - 


| r \bF',r) Sf'wl) 

'I U PL, 


The expectation in Eq. (12) is, in general, very complicated 
since the detector photocounts {k^} are conditional Poisson- 
distributed random variables. Given the estimated intensity 
7(p), the actual source intensity distribution 7 Q (p) can be 
modeled as a random variable with mean /(p). That is, the 
mean photocount expected over the (ij) th pixel, X /; -, is a ran- 
dom variable with mean 
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V r > 



I(p - r)d 2 r 


(13) 



[o- 1 ] 


*/, C m 


When the source intensity is sufficiently high, or over a suffi- 
ciently long integration time, the fluctuation in the Poisson 
count statistics will be small compared to its mean. In this 
case one can approximate the detector photo count in Eq. (1 1) 
by its mean value, and the likelihood function in Eq. (1 1) can 
be reduced to 

F\r) = log [P({K..}\rJ(p))] * \n[P ({A..} I r, (\.(r)})] 

(14) 

By using this approximation, the probability given in Eq. (1 1) 
can be interpreted as the probability of receiving X /y (r 0 ) given 
the source intensity distribution {A /; -(p)}. Since it is assumed 
that / 0 (p) differs from /(p) by a zero-mean Gaussian process 
I x (p), it follows that the probability in Eq. (14) can be written 
as 



/>({X..}|r,/(p)) 


1 

vWH.bJ 



• exp 



-M'» 


Sim 




(15) 


where M is the total number of pixels used in the decision, and 
°7j\m ^ \°ijsim I denote the matrix inverse and the determi- 
nant of the covariance matrix a //Cm , respectively. The ele- 
ments of the covariance matrix can be given by 


Equations (12) and (17) together present an analytical form 
of the mean square estimator error. Given the estimated in- 
tensity pattern /(p), and the intensity correlation function 
0(p,p'), the lower bound for the variance in estimating the 
image location can be calculated. Unfortunately, for general 
distributions of /(p) and 0(p, p'), the expressions in Eqs. 
(17a-c) are very difficult to evaluate. In order to obtain some 
insight into the functional dependence of tracking error vari- 
ance and the source intensity error, some simplifications are 
required. In the following we shall present several simple cases 
that will illustrate these dependencies. 

Example 1 : White intensity noise with spectral density y 2 . 
That is, 


- <(V>- W'))> 


</ 1 (p)/ 1 (p')> = J6(p-p') 


(18) 


0(p-r,p' -r)d 2 pd 2 p' 

(16) 

By differentiating the likelihood function in Eq. (14) and 
taking the expectation values, the Cramer-Rao bound of the 
estimator error variance can be given in terms of Eq. (12) (see 
Appendix) where 


where A is the area of a pixel element, and 8(x) is the Dirac 
delta function. An example of this type of intensity uncer- 
tainty is the random dark counts from the tracking detector. 
By using Eq. (16), the correlation matrix can be calculated. 
The result is 

°ij,Qm ~ ^ 5 /m (*9) 
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where 5 /y . is the Kronecker delta. By substituting Eq. (20) into 
Eqs. (17 a-c), and using the fact that the variables do 
not depend on r , the CRLB can be reduced to 

Var(|Aw 0 l) > 


Note that even though the CRLB still depends linearly on y 2 , 
the lower bound no longer depends on the total detector SNR. 
Consequently, Eq. (22) represents an irreducible error floor 
for the estimator. For the simple test patterns shown in Fig. 1, 
the magnitude of the CRLB can be easily calculated to be 
y 2 (b + 2 a) 3 /a and y 2 n 2 a 2 /2, respectively. 
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( 20 ) 


where the total detector SNR, TV 0 , is factored out by making 
the substitution A,y (r) =TV 0 g^(r). It is easily seen from Eq. (20) 
that the variance in estimating the image location is directly 
proportional to the uncertainties in source intensity distribu- 
tion y 2 . Furthermore, the variance of the estimator error de- 
creases with increasing TV Q and, at a very high signal count, the 
variance is negligible. 


Example 2: White intensity fluctuation with spectral density 
that is proportional to the total signal intensity. In other 
words, 


y 2 N 2 . 

4>(p,p’) = ( 21 ) 

In this case the uncertainty in image brightness is proportional 
to the intensity of the image. An example of this type of in- 
tensity uncertainty is the unknown albedo variation of the 
source. An increase in the integration time at the tracking de- 
tector will only result in a proportional increase of the uncer- 
tainty. Under this condition, the CRLB reduces to 


Var(|r-r 0 l) > 


Zj \dx / Zi \ dy / 
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( 22 ) 


IV. Tracking System Using Sun-Lit Earth as 
a Pointing Reference 

In a typical spatial tracking subsystem, the transmitter 
pointing information is derived from the image location of the 
reference source. The reference source can be either an uplink 
beacon laser, or the sun-lit Earth. The reference signal is re- 
ceived by the telescope and, after spatial and frequency filter- 
ing to cut down the background noise, is focused onto the 
tracking detector. The tracking detector is usually a focal- 
plane array which spans the field-of-view (FOV) of the receiv- 
ing optics, and can be implemented using a large-format CCD. 

Deriving the angular coordinate of the reference source is 
equivalent to determining the position of its image. Since ob- 
jects with angular separation less than the resolution limits 
cannot be resolved by the receiving optics, it is generally desir- 
able to design the optics such that the pixel size on the focal 
plane array corresponds to the resolution limit of the tele- 
scope. Such a design provides maximum spatial information 
with a minimum number of pixels. For a spacecraft at a dis- 
tance of 2.5 AU using a 10-cm transmitter, the image of the 
Earth will span roughly 4-5 pixels. The required pointing 
accuracy, on the other hand, is less than 1/10 of the trans- 
mitted beamwidth. Since the angular resolution of the tele- 
scope is equivalent to the minimal divergence of the trans- 
mitted optical signal, deriving the desired pointing accuracy of 
1/10 the beamwidth is therefore equivalent to locating the 
position of the receiving station to within 1/10 of a pixel size 
based on the Earth image on the focal plane array. 

When the intensity distribution of the pointing reference is 
known, as would be the case when an uplink beacon is used, 
the performance of the receiver is given by Eq. (6). The ex- 
pected detector SNR, TV 0 , can be calculated using simple link 
analysis. By using the additional assumption that atmospheric 
scattering limits the angular divergence of the uplink beacon to 
about 20 jirad, the detector SNR can be approximated by 

N 0 * 5 X 10 2 P S D*T/Z 2 (23) 

where P s is the beacon power in watts, D R is the receiver diam- 
eter in meters, T is the integration time, and z is the link dis- 
tance in AUs. Equation (23) was derived by assuming a 532-nm 
uplink beacon, and that the losses in optics and detector are 


207 



negligible. The actual signal count can be much lower than 
that given by Eq. (23) due to these losses. 

It is easily seen from Eq. (23) that, for a tracking detector 
operating at 1 KHz using a 10-cm diameter receiver at 1 AU, 
the required signal power for a 20-dB SNR is 20 KW! Obvi- 
ously, such a high power can be very costly to achieve. And 
if higher SNR is desired, the required beacon power can be 
even higher. 

An alternative is to use the sun-lit Earth as a pointing refer- 
ence. Sunlight reflection off the Earth can provide a large 
amount of signal power at the tracking detector. In fact, a sim- 
ple calculation shows that a detector SNR higher than 10 5 can 
be easily achieved. As a result, tracking error due to Poisson- 
count statistics is negligible. Unfortunately, the albedo of the 
Earth cannot be specified precisely. Cloud-cover can alter the 
surface reflectivity significantly, and a snow-covered surface 
can reflect up to one order of magnitude more sunlight than 
an exposed terrain. To further complicate the problem, these 
conditions are changing in time so that it is almost impossible 
to derive an accurate estimate of the Earth’s albedo. Since the 
uncertainty is inherent to the brightness of the source, increas- 
ing the integration will only result in corresponding increases 
in that uncertainty. Consequently, there will be an irreducible 
error floor on the tracking system performance. 

The actual impact of albedo uncertainty on the tracking 
system performance depends, of course, on the magnitude of 
the uncertainty and its spatial correlation. In general, the 
CRLB given by Eqs. (12) and (17) is very difficult to com- 
pute. For the simple test patterns shown in Fig. 1, a minimal 
standard deviation of 0.1 pixel requires that y be less than 
0.1 \J a/(b + 2a) 2 3 and 0.045/fl, respectively. If these examples 
are representative, then we will need to know the intensity to 
within 10 percent of the true value in order to limit the point- 
ing error to within 0.1 pixel. Since the average Earth albedo 
variation is much more than 10 percent, a simple ML estimator 
cannot be expected to achieve the desired tracking accuracy. 


V. Conclusions 

Because of the large link distance involved, it is desirable 
that the optical communication package aboard a planetary 
spacecraft derive its pointing reference directly from the 
image of the sun-lit Earth on the tracking detector. Given the 
detector photocounts and the prior knowledge of the image 
intensity distribution, the maximum-likelihood spatial acqui- 
sition algorithm can be derived. The performance of the 
maximum-likelihood algorithm was analyzed by calculating 
the Cramer-Rao bound on the variance of acquisition error. It 
is shown that, when the intensity distribution of the pointing 
reference can be precisely characterized, the variance in esti- 
mating the receiver angular location decreases with increasing 
image intensity or detector exposure time. On the other hand, 
when the intensity distribution is not known in sufficient 
detail, the spatial tracking error variance will not decrease in- 
definitely with increasing exposure time. For a planetary 
spacecraft using the Sun-lit Earth as a pointing reference, the 
Earth’s albedo cannot be precisely specified because of chang- 
ing weather and ground conditions. Consequently, the ML 
algorithm which derives the pointing reference based on the 
detector photocounts cannot be expected to provide an accu- 
rate pointing reference. 

It should be noted, however, that the results presented in 

this study were derived by assuming that the receiver derives 

its pointing reference based on a single frame of the image. In 

other words, the receiver estimates the image location, r , 

based on the receiver count statistics and an assumed source 
/\ 

intensity distribution, /(p). When multiple images of the point- 
ing reference are available, the receiver can then jointly esti- 
mate the true source distribution, 7 0 (p), and the image loca- 
tion, r. For such a receiver, the tracking system performance 
will not be limited by the intensity uncertainty. In fact, with a 
sufficiently large number of look angles, one could derive an 
estimate /(p) which closely approximates / 0 (p) and, conse- 
quently, the variance in estimating r could be reduced to an 
acceptable level. 
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a 



G (r 0 ) = a (/ 3 + 2a ) 


Fig. 1. The value of G(r Q ) for two simple examples of source 
intensity distributions over the detector focal plane. The solid grids 
represent the CCD pixels, and the shaded areas represent the image 
of the pointing reference. The image intensities are assumed to be 
uniformly distributed. 




Appendix 

Derivation of Equation (16) 


The logarithm of the joint probability distributioni > ({X / }| r,I(p)), is 
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By differentiating Eq. (A-l) and using the fact that E [d^CX^I r,T(p))/bx] is equal to zero for all r, one can show that 
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In deriving Eq. (A-3), we have used the assumption that {X f/ } are Gaussian distributed with mean {X /; .}. 
By differentiating Eq. (A-l) and taking the expectation values of X (/ , Eq. (A-2) is reduced to 
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By substituting Eq. (A-3) into (A-4), it is seen that 


3 \ ^ 

— ln[P({\, y (r 0 )}|r,f(p))]| 


Similarly, it can be shown that 





(A-5) 


(A-6) 


and 




dx 



i/,fi m 



(A-7) 


211 



N89-20350 


TDA Progress Report 42-95 


July-September 1988 


A Preliminary Weather Model for Optical Communications 

Through the Atmosphere 

K. S. Shaik 

Communications Systems Research Section 


A preliminary weather model is presented for optical propagation through the atmo- 
sphere . It can be used to compute the attenuation loss due to the atmosphere for desired 
link availability statistics. The quantitative results that can be obtained from this model 
provide good estimates for the atmospheric link budget necessary for the design of an 
optical communication system. The result is extended to provide for the computation of 
joint attenuation probability for n sites with uncorrelated weather patterns. 


I. Introduction 

During the past few decades, the pioneering work of 
Tatarski [1], Fried [2,3], Hufnagel and Stanley [4], and 
Ishimaru [5] , has set the stage for the development of a con- 
sistent theory to evaluate the performance of optical commu- 
nications and imaging through the atmosphere. However, there 
is a conspicuous lack of weather models that may be used to 
obtain even a first-order estimate of statistics for the avail- 
ability of an optical communications link under ambient 
weather conditions. 


The light energy of a beam dissipates as it travels through 
the atmosphere, due to scattering and absorption. If it can be 
assumed that (1) attenuation loss is independent of radiation 
intensity and (2) the absorbing and scattering events occur 
independently of each other, the atmospheric attenuation can 
be expressed by the Bouguer-Lambert law: 


Iiy) = I Q {v) exp 



y(v,z)dz 


( 1 ) 


where I(v) is the observed irradiance at optical frequency v, 
/ 0 (v) is the irradiance that would have been observed if the 
beam were propagating in a vacuum at a distance Z from the 
source, and y(v, z ) is the total extinction coefficient due to 
scattering and absorption by the atmospheric constituents at 
position z. The magnitude of the argument of the exponential 
in the above equation is known as the optical depth or optical 
thickness, r, of the medium, i.e.. 


T 



y(v, z)dz 


( 2 ) 


Experiments [6] with artificial fog and smoke and diluted 
solutions of milk show that the Bouguer-Lambert law holds 
well for optical thickness r < 12. The optical depth can be 
simply related to the attenuation loss, L, in dB by the fol- 
lowing approximation: 


L »» 4.34r 


( 3 ) 
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The distribution of gas molecules, clouds, fog, haze, aero- 
sols, and other particulates in the transmission path influence 
the computation of attenuation loss. These phenomena at best 
can be interpreted on a statistical basis. The system designer 
may have to be content with a probability for which the atten- 
uation of the atmosphere due to ambient weather is less than 
some critical value Z dB. Hence we define the probability, 
w^Z), of the random weather variable, W , for a single site as 

Wl (L) = P W (KL) (4) 

where / is the loss variable. wq(Z) is the fraction of weather 
conditions at any given site for which the attenuated direct 
beam irradiance /(v) > / Q (v) exp [-0.23 Z] . The weather 
model developed below can be used to compute this proba- 
bility. The result is also extended to obtain the joint weather 
probability, h^(Z), that at least one of the sites has extinction 
loss / < Z forn independent sites receiving simultaneously. 


rapidly with increasing concentration of aerosols, fog, 
haze, and clouds. Optical attenuations L > 1000 dB are 
possible, where the validity of the Bouguer-Lambert 
law is questionable. Table 1 shows typical ranges of 
attenuation values for various types of clouds [8] . 
Since it is unlikely that an optical communication 
system will be designed for Z > 30 dB, the Bouguer- 
Lambert law serves as a good approximation. As shown 
in Fig. 2, the probability of opaque cloud cover over 
the southwestern U.S. is 20% [7] . Let us take a pessi- 
mistic view and assume that attenuation loss due to all 
opaque clouds is 100 dB or higher, and use this assump- 
tion to estimate one of the model parameters. 

(4) It is further assumed that the attenuation of the beam 
has an exponential distribution for L >L 0 . The hypo- 
thesis is supported by visual observations, and also by 
the experimental data collected at Goldstone in Cali- 
fornia for atmospheric propagation at 8 to 10 Ghz. 1 


II. Weather Model 

The following observations are made in order to arrive at 
the preliminary weather model: 

(1) It is assumed that for some fraction of the time p, 
where 0 < p < 1, clear weather conditions hold. This 
number can be determined approximately from exist- 
ing data on cloud cover and visibility for potential 
sites. An analysis of two years of GOES satellite data 
by Wylie and Menzel [7] shows that the probability 
of having clear weather in the southwestern U.S. is 
over 60% (Fig. 1). 

(2) The attenuation under clear weather conditions is due 
to scattering and absorption by air molecules and 
sparse particulate matter in the upper atmosphere. .The 
attenuation due to molecular absorption is approxi- 
mately 0.5 dB [6] . The attenuation due to molecular 
scattering is of the same order. Given that the atmo- 
sphere is never totally free of aerosols and thin cirrus 
clouds, on an average clear day the attenuation loss 
would be in the range of 1 to 3 dB. For simplicity, 
let us assume that the minimum attenuation loss due 
to the atmosphere is 3 dB. Hence, the model defines 
clear-air atmosphere, which occurs with probability 
p, as having an attenuation lossZ 0 = 3 dB. Note that 
this attenuation loss refers to near-zenith propagation 
through the entire atmosphere. For any zenith angle 0 , 
the path attenuation can be written as Z Q sec 6. How- 
ever, the remainder of the discussion assumes near- 
zenith propagation paths, i.e., 0=0. 

(3) The probability of not having clear weather isq = 1 -p. 
The attenuation loss of the atmosphere increases 


With the foregoing assumptions in mind, it is now possible 
to postulate a weather model. The weather cumulative distri- 
bution function (CDF) for a single site can be modeled as 

w'jCO = 1 -?exp [-0.23 *(Z. - i 0 )] {L>L Q ) 


( 5 ) 

where w^Z) is defined in Eq. (3) above, q is the probability 
fraction when the weather is not clear, b is a site-dependent 
parameter to model the slope of the CDF curve, and Z 0 is 
the minimum attenuation loss due to the atmosphere. Note 
that w x (L = L 0 ) = p . The value of b may vary with geo- 
graphical location and altitude, and can be inferred from 
observed visibility and extinction loss data for potential 
receiving sites. For the southwestern U.S. region, q = 0.4, 
and from the third item above, the attenuation loss is 100 dB 
when w^L = 100) = 0.8. Using these values in Eq. (5), we find 
that b = 0.03. This estimate of the parameter b will be used in 
numerical examples later. 


Equation (5) can be recast in a more familiar form for an 
optical communication system designer to give the attenua- 
tion loss in terms of the weather probability uq(Z), i.e., 


L = 


L o . 


Z rt + 


1 


o 0.23 b 


In 


1-VZ) 


for vv 1 (Z) <p 
for (Z)> p 


( 6 ) 


1 JPL Internal Document 810-5, Rev. D, 1988. 
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Figure 3 is a plot of attenuation loss as a function of the 
weather probability for the southwestern region of the U.S., 
for which q = 0.4, b =0.03, and L 0 = 3 dB. These estimates, 
needless to say, are quite important for the system designer to 
determine the link budget for loss due to the atmosphere for 
Earth-space optical communication paths. 

It is also possible to extend the result to n sites receiving 
simultaneously. From Eq. (4), it is easy to see that the comple- 
ment of the probability w^Z,) is given by 

P w (l>L) = w'd) = qexp[-0.23b(L-L 0 )] 

(L>L 0 ) (7) 

The joint probability of n sites, vv£(Z,), that the attenuation 
loss /• > L for all i can be written as 

<a) = „„(/,> l,i 2 > l,..., / n >D 

( 8 ) 

where the subscripts 1 , 2, . . . , n label the receiving sites. When 
the weather conditions for all sites are independent and iden- 
tically distributed (IID), we have 

w„ c (Z) = P Wi (l t > L) P W2 (l 2 P w Jl n > L ) 

= [P w U>l)] n (9) 

Using Eq. (9), the joint probability that at least one site has 
attenuation KL is found to be 


w n (L) = !-<(/,) = 1 - [q exp [-0.23 b{L-L Q )]] n 

(L>L 0 ) (10) 

For a single site with p = 0.6, L Q = 3 dB, and b = 0.03, the 
probability that the attenuation/, < 3 dB is = 3) = 0.60. 
If three such IID sites are chosen, we have w 3 (Z = 3) = 0.94. 
In other words, if a system is designed to absorb an extinction 
loss of 3 dB, a three-site receiving network will be functional 
94% of the time. Figure 4 plots the fraction of the total period 
under ambient weather conditions when the attenuation is 
< 3 dB as a function of the number of sites. Table 2 gives the 
expected dB loss for a desired link availability percentage for 
up to four joint receiving sites. It is, however, not very clear 
how the independence of weather patterns at various sites can 
be insured. It is known that the scale size of weather patterns 
is on the order of a few hundred kilometers, and this measure 
may be used to find sites with uncorrelated weather. Joint ob- 
servation of weather parameters for the probable sites will be 
necessary to make a more accurate determination. 

III. Conclusion 

The virtue of the weather model presented here lies in its 
simplicity. It may be applied with ease to obtain a first-order 
magnitude of probabilities w n (L) for an optical system that 
must operate in the atmosphere. The computation of these 
probabilities, for example, will provide a good statistical esti- 
mate of the attenuation loss to an optical communication 
system designer for link availability. 

The model does not consider frequency dependence, since 
it has been studied thoroughly and well documented in LOW- 
TRAN computer code developed at the Air Force Geophysics 
Laboratory [9] . The model also disregards seasonal variations, 
which can be incorporated later when adequate data bases 
have been developed. 
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Table 1. Typical range of values for cloud attenuation 
(adapted from [8]) 


Cloud 

type 

Cloud base 
height, 
km 

Cloud 

thickness, 

km 

Total 

vertical 

attenuation, 

dB 

Stratus 

0.1 - 0.7 

0.2 - 0.8 

26 - 350 

Stratocumulus 

0.6 - 1.5 

0.2 - 0.8 

3 - 100 

Nimbostratus 

0.1 - 1.0 

2 - 3 

260 - 1300 

Altostratus/ 

cumulus 

2-6 

0.2-2 

9 - 260 

Cumulus 

0.5 - 1.0 

0.5 -5 

22 - 870 

Cumulonimbus 

0.5 - 1.0 

2-12 

260 - 5200 

Cirriform (ice) 

6-10 

1.0 -2.5 

1 - 15 

Fog 

0 

0-0.15 

0-13 


Table 2. Attenuation loss as a function of desired link 
availability for n sites receiving jointly 

Percentage 

weather 


Attenuation loss, L y dB 


n = 1 

n = 2 

n = 3 

n = 4 

60.0 

3.00 

3.00 

3.00 

3.00 

62.0 

10.34 

3.00 

3.00 

3.00 

64.0 

18.07 

3.00 

3.00 

3.00 

66.0 

26.25 

3.00 

3.00 

3.00 

68.0 

34.92 

3.00 

3.00 

3.00 

70.0 

44.16 

3.00 

3.00 

3.00 

72.0 

54.03 

3.00 

3.00 

3.00 

74.0 

64.63 

3.00 

3.00 

3.00 

76.0 

76.08 

3.00 

3.00 

3.00 

78.0 

88.53 

3.00 

3.00 

3.00 

80.0 

102.16 

3.00 

3.00 

3.00 

82.0 

117.23 

3.00 

3.00 

3.00 

84.0 

134.09 

3.00 

3.00 

3.00 

86.0 

153.19 

12.55 

3.00 

3.00 

88.0 

175.24 

23.58 

3.00 

3.00 

90.0 

201.32 

36.62 

3.00 

3.00 

92.0 

233.25 

52.58 

3.00 

3.00 

94.0 

274.40 

73.16 

6.08 

3.00 

96.0 

332.41 

102.16 

25.41 

3.00 

98.0 

431.57 

151.74 

58.47 

11.83 

99.0 

530.72 

201.32 

91.52 

36.62 
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PROBABILITY OF CLEAR SKY, PERCENT 

Fig. 1 . Contour diagram obtained from 2 years of GOES satellite data showing the probability 
of clear sky over the United States [7]. 



PROBABILITY OF OPAQUE CLOUD COVER, PERCENT 

Fig. 2. Contour diagram obtained from 2 years of GOES satellite data showing the probability of 
opaque cloud cover over the United States [7]. 
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A novel Automatic Frequency Control (AFC) loop based on an Extended Kalman 
Filter (EKF) is introduced and analyzed in detail The new scheme involves an EKF 
which operates on a modified set of data in order to track the frequency of the incoming 
signal The algorithm can also be viewed as a modification to the well known cross- 
product AFC loop . A low carrier-to-noise ratio (CNR), high-dynamic environment is used 
to test the algorithm and the probability of loss-of-lock is assessed via computer simula- 
tions. The scheme is best suited for scenarios in which the frequency error variance can 
be compromised to achieve a very low operating CNR threshold. This technique can 
easily be incorporated in the Advanced Receiver (ARX), requiring minimum software 
modifications. 


I. Introduction 

The algorithm introduced and analyzed herein is another 
application of the theory of Kalman filters [1] to the prob- 
lem of estimating the parameters of a sinusoid embedded in 
noise. The use of Kalman filters in tracking time-varying 
parameters in general and the phase and/or the frequency of a 
sinusoidal signal in particular, has been well investigated by 
many researchers and is extensively documented in the litera- 
ture [2] -[4] . In this particular application, the parameter of 
primary interest is the frequency of a pure sine wave which 
should be estimated in the presence of high dynamics at a low 
carrier-to-noise ratio (CNR). Typically, it is applied in a region 
where the Phase Locked Loop (PLL) loses lock frequently due 
to cycle slipping [5] and hence is unreliable. 


The scheme sought should be easy to implement and 
possess the same order of complexity as the PLL. The goal 
here is to be able to track the frequency at a low CNR with 
a simple scheme that trades complexity for root-mean-squared 
(rms) frequency error performance. The automatic frequency 
control (AFC) algorithms discussed in the literature are 
typically more complex than the PLL [6] , [7] except for the 
cross-product AFC loop, which performs poorly in terms of 
lowest operating CNR. In a typical operating environment, if 
the PLL loses lock due to a severe maneuver by the trans- 
mitter, the proposed simple AFC algorithm could then be 
used to track the doppler until the dynamics are well within 
the tracking capability of the PLL, which can then track 
again with the frequency initialization provided by the AFC 
algorithm. 
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II. Description and Analysis of Algorithm 

One approach to tracking the frequency of a sinusoidal sig- 
nal at a lower CNR than the PLL lies in modeling the dynamic 
process in differential form and hence tracking only the differ- 
ential dynamics. Thus in this model, the phase process can not 
be tracked directly and is really compromised in order to fur- 
ther lower the threshold at which the frequency process can be 
estimated. However, given its initial value, the phase can later 
be derived from the frequency by integration, assuming the 
latter is estimated with a suitably small rms error. 

There are numerous ways to arrive at a differential signal 
model to track the frequency; for example, each of the in- 
phase and quadrature samples at a specific time can be ex- 
panded using a Taylor series as a linear combination of the two 
previous samples [8] and that, after some further processing, 
can lead to a signal model in the desired form. The disadvan- 
tage of this scheme is that it tends to be quite computationally 
demanding because it considers higher-order approximations 
to the measurement function. In this article, the phase infor- 
mation is removed by manipulating the samples in a more 
direct manner, i.e., a simple cross-product between the in-phase 
and quadrature samples is performed in a manner similar to the 
cross-product AFC loop. The modified samples are then fed 
into an Extended Kalman Filter (EKF) that tracks the differ- 
ential phase change from which the doppler can be deduced. 

In the derivation to follow, the vector notation which is 
standard in the theory of Kalman filters is adopted. The re- 
ceived signal is observed in the presence of additive narrow- 
band white gaussian noise with one-sided power spectral den- 
sity N Q (watts/Hz). The carrier is first removed by mixing the 
observed waveform with a fixed reference, after which the in- 
phase and quadrature signals are sampled at a fixed rate f s = 
77 1 where T s is the sampling time. Hence the samples that 
need to be processed can be expressed in vector form as 


r{k) = 


’>}(*)' 


A sin 6(k ) 

N k \ 


A cos 6(k) 


+ n(k) 


0 ) 


where ^ denotes the 2 X 2 identity matrix. Typically, the re- 
ceived samples rj(k ), r^(A:) form the input to an EKF with a 
state vector consisting of the phase d(k) and its various deriva- 
tives since the latter is modeled as a polynomial in time of 
sufficient degree to account for the dynamics encountered. 
The advantage of this approach is that it tracks the phase and 
frequency with small rms error when operating above thresh- 
old, defined here as the CNR at which frequency loss-of-lock 
occurs with a probability of 0.1. Because frequency is the pri- 
mary parameter in this application, we expect to further lower 
the operating threshold by compromising the phase estimates 
and the rms error performance. The block diagram of the new 
scheme, referred to as Frequency EKF (FEKF), is shown in 
Fig. 1. The cross-product is performed in order to remove the 
phase from the samples, as follows 


= r^r^k - 1 )~r Q {k)r I {k - 1) 

(3) 

2 Q (k) = rj(k) rj(k - l) + r Q (k) r Q (k - 


In doing so, the effective noise has been increased and this will 
of course result in a larger frequency error. The modified sam- 
ples Zj(k) and Zg(k) are then used in the EKF which tracks 
only the differential phase, not the pseudo-phase. This results 
in a reduction in the order of the EKF and hence in complex- 
ity. Defining CNR as the carrier power to the one-sided power 
spectral density level of the noise, we have 



(4) 


From Eq. (2), N Q is equal to 2T s o 2 where o 2 is the variance 
of the noise samples n f (k), n Q (k ). Hence, CNR can be ex- 
pressed in terms of o 2 as 


CNR = 


2 ry 


(5) 


where k is the discrete time, d(k) the phase of the received 
signal, A its amplitude and n T (k) - [nj(k) n Q (k)] is a zero 
mean gaussian vector, the subscripts / and Q denoting the in- 
phase and quadrature components respectively. Hence, 


Without loss in generality, we can now set A to unity, as the 
CNR and the sampled noise variance are inversely related by 
Eq. (5). Plugging Eq. (1) into Eq. (3) and expanding, we easily 
obtain 


E[n(k)] = 0 


E[n(k)n T (k)] 



( 2 ) 


Zj(k) = sin A 6{k) + n' J (k) 

( 6 ) 


z Q (k) = cos A d(k)+n'g(k) 
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where the effective noises rij(k ), n^k) are given by 

n^k) = n f (k - 1) sin 6 (k) + n Q (k) cos 6 ( k - 1) 

+ « 7 (k - 1 )n Q {k)-n Q (k - 1) cos d(k) 

- n f (k) sin d(k - 1) - « 7 (k) n Q (k - 1) 

rig(k) = rig(k - 1) sin 6(k) + n Q (k) sin 0 ( k - 1) 

+ n Q (k) n Q (k-\) + n Q (k-\) cos 6(k) 

+ n f (k) cos 6(k - 1) + n^k) n^k - 1) 

and the differential phase at time k is defined by 

A6(k) = e(k)-0(k- 1) (8) 

The differential phase itself can be modeled as an nth order 
polynomial whose derivatives constitute the components of 
the state vector x(k) of the FEKF as follows 


In the above, J(t ) stands for jerk and denotes the second 
derivative of the differential phase or the third derivative of 
the pseudo-phase. Assuming that the jerk is a zero-mean white 
process with one-sided spectral level Nj, we obtain 

E[vlm = =o)T] (13) 

(7) where oj denotes the variance of the sampled version of J(t). 
Denoting by Q the covariance matrix of v.(k), it is easily shown 
[3] that 



i 

CN 




s s 


1 T 


3 2 


s 

Q = °j t s 

T 

$ = 



i 


0 1 


. 2 




The EKF equations are derived in [1] and are repeated below 
for convenience in recursive form. 

x(k + l/k) = x(k/k- 1) + K(k) [z(k)-h(x(k/k-l))] 


A6(k) = i r x(k) ; = [1,0, ... ,0] 

x(k + 1) = 


(9) 


where is a disturbance term that models the random 
changes in parameters due to dynamics, and $ denotes the 
state transition matrix. In the remainder of the paper, we will 
concentrate on the second-order FEKF, but the analysis can be 
generalized to any order. For a second-order FEKF, the state 
x T (k) becomes 


x T (k ) = [A0(fc) Aco(A:)] (10) 


where Aco(k) is the derivative of Ad(k). In that case, Eq. (9) 
yields 

A0(k + 1) - A0(k)+T AO^ + v^k) 

(ID 

Aco(k+l) = Ac o(k) + v 2 (k) 


where 


r kT s t 2 — / 

>»/(*) = -n~lv J ^ dT ; /=1 > 2 

J(k-l)T s K h 

( 12 ) 


(15a) 

K(k) = $ £(*/* - 1 )H{k) [^(k)^(klk - \)H{k) + 

(15b) 


Y (* + */*) = a2 £ !>/* - - 2 >/* - l M k ) 


• (H T (k)Y( k / k - op ) + *) _1 

•H T (k) 2 (*/*- l)j£ r + G (15c) 

where 


*(£(*)) = 


sin (£5 *W) 
cos (£ r x(ft)) 


H T (k) = ~h(x) | x= 


\x=7(k/k- 1) 


cosA0(k/k-l) 0 
- sin AO {k/k - 1) 0 


(16) 


221 



and R is the covariance matrix of the noise vector g(k) given 
by 


where H(z ) denotes the closed loop transfer function, related 
to the loop filter F(z) via 


R = o*l ; o* = 2 (a 2 +ct 4 ) (17) 


The weighting coefficient a is typically used to adjust the clas- 
sical trade-off between adaptation time in transient situations 
and steady-state error. The noise sequences n'j(k), n^k) are 
colored but Eq. (17) reflects the statistics only at a specific 
time. No attempt is taken to whiten the sequences in order to 
abide with the original goal of a simple scheme. The weighting 
coefficient a can be used in addition to o j to control the 
effective bandwidth of the EKF, as will be shown later in the 
linear analysis. 


The performance of the FEKF operating in the presence 
of noise can only be derived in the steady -state, in which case 
the matrix 2(k + \jk) is independent of k. From Eq. (15c), 
it is not clear that the FEKF will reach steady state because 
l,(k + 1 jk) depends on the matrix#(/r) which in turn depends 
on the predicted value A d(k/k - 1). However, it is shown in 
the Appendix that the right-hand side of Eq. (15c) is in fact 
independent of &§{k/k ~ 1), and that both the linear and non- 
linear filters reach the identical steady state. Furthermore, an 
equivalent steady-state nonlinear model is derived in the 
Appendix which is used to compute the error variance in white 
noise. Defining the steady-state matrix Z to be 


y> lim y](/c + 1 Ik) = 

^ k-+°° ^ 



(18) 


the equivalent linear loop model is shown in Fig. 2 where 
n eq (k) denotes the equivalent noise given by 

n eq (k) = n’jik) cos (k) - n^k) sin A$(k) 


(19a) 


with power spectrum (assuming <p(k ) = 0) 


H{z) 4 


F(z) 

l+F(z) 


z{o]+Tp)- of 


z2 (°i + tf) + z ( T s P- a 


■2(7*) + °n 


( 21 ) 


The closed loop transfer function is depicted in Fig. 3 for a 
fixed a (1 .005), Nj (200), and 7^(2 msec). The corresponding 
loop bandwidth B L defined in the Appendix is shown in Fig. 4 
versus the ratio a?/a 2 . When the measurement noise is domi- 
nant (i.e., oj/o„ « 1), the bandwidth is independent of CNR 
and is controlled by a. On the other hand, when the loop is 
dominated by the process noise (i.e., oj/o 2 » 1), the band- 
width is a function of both CNR and a. The performance in 
the absence of dynamics is shown in Fig. 5 as a function of 
CNR when T s = 2 msec. The theory and simulation are in agree- 
ment as long as the loop signal-to -noise ratio (SNR) is “high.” 
For low loop SNR, the loop is nonlinear and the performance 
is degraded. Note from Fig. 6 that the noise power spectrum is 
not always white in the loop bandwidth, hence Eq. (20) can 
not be simplified any further in general. 


III. Performance in a Dynamic Environment 

The performance of the FEKF in a dynamic environment 
can only be assessed through simulations. In order to compare 
the performance with other schemes, the FEKF was tested in 
the presence of the identical dynamics described in [9] , which 
exhibit two 100 g/sec jerks lasting for 0.5 sec each. For a 
fixed sampling time of 2 msec, the best performance in terms 
of lowest achievable CNR threshold is obtained when a is 
1.005 and Nj is equal to 300. This corresponds to 22.5 dB-Hz 
(Fig. 7) with an rms frequency error 41 .2 Hz (Fig. 8). The loop 
bandwidth at threshold is about 7.1 Hz with a 35.2 Hz steady- 
state error due to the jerk. The contribution of the noise in the 
linear model is about 27.3 Hz. Note that for fixed a and Nj , 
the loop bandwidth is a function of CNR as mentioned earlier. 


S n (z) = -o 2 z + 2(a 2 + a 4 ) - a 2 z 1 (19b) 

Letting <p(k) = A 6(k) - A 6(k) denote the differential error 
phase at time k y it is shown in the Appendix that the error 
variance is given by 



Compared with other frequency tracking schemes using the 
same trajectory, the threshold of the FEKF is 3.5 dB lower 
than the PLL, 2.2 dB better than the CPAFC loop, 1.5 dB 
better than a fourth-order EKF tracking phase, and finally 
0.5 dB more efficient than the approximate Maximum Likeli- 
hood (ML) scheme described in [9] . However in terms of rms 
frequency error, the FEKF is worse than all the above loops 
except for the CPAFC loop which exhibits an inferior perfor- 
mance. In terms of complexity, the FEKF requires the same 
number of computations per update as the PLL when imple- 
mented in the steady state. 
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IV. Conclusion 

A new AFC loop was introduced and analyzed. The heart 
of the loop involves an EKF which operates on a modified 
set of data in order to track the frequency. The scheme can 
also be viewed as a modification of the well known cross- 
product AFC loop. 

A detailed analysis of the second-order loop was presented 
and verified via simulations. The parameters of the FEKF were 


related to tracking parameters such as loop bandwidth and fre- 
quency jitter due to noise. The steady-state error due to jerk 
was also assessed. 

The algorithm is best suited for scenarios in which fre- 
quency error is of secondary value and lowest operating CNR 
threshold is of primary concern. This technique is easily imple- 
mented and requires a minimum amount of computations per 
update. Moreover, it is highly suited for the ARX as it requires 
minimum software changes. 
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Fig. 1. General block diagram of the FEKF. 
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Appendix 

Performance of the FEKF in the Steady State 


The error covariance matrix 2(fc + l/k) satisfies the recur- 
sive Eq. (15c) which seems to depend on A@(k/k - 1). Defin- 
ing the matrix 


Q(k) = H T (k)Z(k/k - \)H{k)+R 


(A-l) 


and replacing ]±(k/k - 1) by its steady-state value given by 
Eq. (18), we have (letting x denote A d{kfk - 1)) 




cosx 0 
-sin* 0 


<7? P 



> 

0 

+ 

n 


0 

1 

V 


o? cos 2 x + a 2 -a 2 cos x sin x 

1 n 1 


-a? cos* sinx a? sin 2 x + a 2 

l in 


Inverting £2(fc), we obtain 
1 


Q (k) 


o 2 a 2 + a 4 

n 1 n 


(A-2) 


a 2 sin 2 x + a 2 a? sinx cosx 

1 n 1 


a 2 cos x sin x a 2 cos 2 x + a 2 


(A-3) 


which when pre- and post-multiplied by H(k) gives 


would have resulted in the same matrix as in Eq. (A4) and 
hence in the identical matrix 2. 

In the steady state, the measurement vector z(k ) is sub- 
tracted from the prediction h( •) before being multiplied by 
the gain matri *K(k) as in Eq. (15a). Defining ^(k) to be 


L(k) = OPg' 1 ^) 
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we obtain in the steady state (x = A6(k/k - 1)) 
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Equation (A-7) combined with Eq. (15a) suggests the structure 
depicted in Fig. A-l. The output of the frequency disciminator 
d ( k ) is given by 


which is independent of A 6(k/k - 1). The steady-state matrix 
2 can be computed in closed form only when a is equal to one 
[10], otherwise the solution can only be obtained numeri- 
cally. One approach is to run the FEKF until it reaches steady 
state and the resulting matrix is a solution of Eq. (15c) because 
of the properties of Kalman filters. Note also that the linear 
measurement (i.e., z(k) = A 9(k) + n’(k)) with H T = [1 0] 


d{k) = z^k) sin A0 (k/k - 1) - z^(k) cos A d(k/k - 1) 

(A-8) 


Using Eq. (6), d (j k ) simplifies to 

d{k) = sin <t>(k) + n eq (k) (A-9) 
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where q(k) denotes the differential phase error and 

n eq (k) ~ n j(k) cos A6(k/k - 1) - n q (k) sin M(k/k - 1) 

(A- 10) 


It is straightforward to show that 


where H(z ) is the closed loop transfer function given by 
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From classical digital phase locked loop analysis, it is straight- 
forward to show that the error in the absence of dynamics is 
given by 


which, assuming zero error, results in the following power 
spectrum 
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while the steady-state error due to jerk (in units of m/sec 3 ) is 


The resulting nonlinear model is shown in Fig. A-2. Using z- 
transforms, it is straightforward to show that 
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where co f * is the radian frequency of the incoming signal and c 
the velocity of light (in m/sec). The one-sided closed loop 
bandwidth B L {Hz) defined by 


where Z(z) and W(z) are the ,z -transforms of x(k/k - 1) and 
co(k), respectively. The loop filter F(z) is then easily derived 
and is equal to 


F(z) = [1 0] 
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Figure A-3 depicts the simplified nonlinear model in terms of 
the loop filter F(z). Approximating sin (<; p(k )) by 0(fc ) for 
small error, we have, using operator notation 


0(z) = [1 -tf(z)] A d(z) -H(z) {n eq (k)} (A-15) 
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can be computed in closed form using the results found in 
[11] to give 
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This article describes a data collection system installed on the 400-kilowatt X-band 
transmitter of the Goldstone Solar System Radar . The data collection system is built 
around off-the-shelf IEEE 488 instrumentation , linked with fiber optics , controlled by an 
inexpensive computer, and uses software written in the Ada language . The speed and 
accuracy of the system is discussed, along with programming techniques used for both 
data collection and reduction. 


I. Introduction 

The system described in this article is the result of two 
separate goals. The first goal was to instrument the Goldstone 
Solar System Radar (GSSR) transmitter to start building a 
base of operating data that could be used for statistical analy- 
sis. The second goal was to demonstrate the feasibility of a 
data collection system that used only readily available, easily 
calibrated instruments. This article is concerned with the 
extent to which this second goal has been realized. 

The Ada language was chosen for a number of features: 
(1) multitasking is provided within the language, (2) it embod- 
ies a number of software engineering disciplines such as modu- 
larity, strong typing, and levels of abstraction, which will 
result in robust and more maintainable programs, and (3) the 
federal government maintains a standard for the language, and 
enforces conformity to the standard, which will contribute to 
maintainability of Ada programs. 

II. Hardware Description 

Figure 1 is a block diagram of the entire system, as installed 
at DSS 14. The control computer and one data collection unit 


are located in Room 105. This data collection unit has been 
wired into the Local Control Console (LCC) to provide access 
to 26 analog values, 53 interlocks, 40 indicators, and 9 warn- 
ings. The second data collection unit is located in Module II, 
just below the tri-cone area, and connected to Room 105 by a 
fiber-optic link. This unit monitors 18 resistive temperature 
detectors (RTDs), 4 turbine flow meters, and 8 paddle-wheel 
flow meters, all of which were added in the radar feedcone 
during the 70-meter upgrade period. 

A. Control Computer 

An IBM industrial AT computer was chosen as the con- 
troller. This computer is similar to the standard AT, but is 
rack-mounted and provided with an air filter and vibration 
damping mounts for expansion cards. It has an IEEE 488 
interface card, which allows a wide range of instruments to be 
monitored. 

B. Data Collection Units 

The HP 3852S Data Collection Unit is a rack-mounted card 
case, with local programming capability and a wide range of 
plug-in cards. The cards used in this system include a 514-digit 
voltmeter (option 44701A), a 5-channel counter (option 
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4471 5 A), and several multiplexer cards. Reference 1 contains 
a summary of cards available, and their capabilities. 

C. Fiber Optic 

The data collection unit in the cone area is linked to the 
pedestal by a pair of HP37204A fiber-optic bus extenders. 
These units are completely transparent to the control com- 
puter, and use internal protocols to ensure error-free data 
transmission. The maximum data transmission rate of 60,000 
bytes per second is adequate for this application. 

III. Software Description 

Figure 2 is a “Booch” diagram of the overall structure of 
the controlling software at a high level of abstraction. This 
style of diagram is introduced in [2] , which is an excellent 
introduction to Ada. The central unit is the HARDWARE_ 
DEFINITIONS package, which contains most of the informa- 
tion specific to the transmitter. This package provides a list 
of all the parameters available (an ENUMERATION type), 
functions that return the current value and status of each 
parameter, and an internal task that periodically reads the 
data collection units. Figure 3 is the declaration of the type 
PARAMETER, with explanatory comments. The names chosen 
for the parameters are abbreviations descriptive of the func- 
tion. For example, Filament Time Delay on klystron 1 becomes 
FILTDl (an interlock) and Alidade Heat Exchanger On 
becomes ALIHEON (a warning because the main heat 
exchanger should be adequate). Some other abbreviations 
used are UC for Under Current, CB for Crow Bar, IGN for 
Ignatron, IL for Interlock, RPA for Reflected Power Amplifier, 
COLL for Collector, V for Voltage, I for Current, and UA for 
Microamps. 

The GSSR program contains the operator interface. It con- 
tains three display tasks, each of which can be directed to dis- 
play a different view of the data, and a keyboard watching 
task that switches the views based on operator inputs. The 
DATA LOG package monitors the measured data and writes 
it to disk. 

A. Utility Packages 

The SCREENIO package provides facilities for multiple 
screen windows, each with its own attributes. These windows 
are used by the display tasks to present independent views of 
the measured data. It also provides functions to monitor the 
keyboard. Although they are not used in this application, it 
provides control over the screen modes and graphics capability 
for both color graphics and enhanced graphics adapters. 

The IEEE488 package provides facilities for sending and 
receiving ASCII or binary data, sending bus commands, serial 


or parallel polling, and detecting service requests. It contains 
an internal task for resource control. This is needed in a multi- 
tasking environment to ensure that different tasks cannot send 
conflicting messages. 

B. Update 

The UPDATE package encapsulates all the information 
about the data collection unit configurations. Several tech- 
niques are used to minimize the number of transactions on the 
IEEE 488 bus. First is the use of downloaded subroutines in 
the scanners. On startup, this package loads each scanner with 
a program that will take all the required measurements. This 
allows a set of measurements to be taken with the command 
“CALL DOMEAS” rather than a repetition of what measure- 
ments are to be taken. The second technique is the use of 
block transfers in binary rather than individual data values in 
ASCII. This reduces the number of bytes sent by about a 
factor of 3. A third technique is preprocessing the interlocks 
and indicators (which are either 0 or 28 volts) and sending 
only the numbers of the channels that exceed a threshold 
(2 bytes) rather than the actual values (8 bytes). Fourth is 
overlapping the measurement time of one scanner with the 
data transfer time of the other scanner. 

C. Calculated Parameters 

In addition to the parameters measured directly, there are 
a number of calculated ones. Thermal techniques as described 
in [3] are used to calculate the power in the water loads on 
each of the two klystrons and the waster load on the four-port 
power combiner. In addition, a 4-foot section of the wave- 
guide between the power combiner and the feedhorn has been 
calibrated and instrumented for a thermal determination of 
the total power being delivered to the antenna. The time 
remaining in the present cycle (transmit or receive) is calculated 
from the round-trip light time (entered by the operator) and 
the time of the last change in the beam status. This module 
also determines the correct scale factor for the vacuum-ion- 
pump current on each klystron from the range indicators. 

D. Data Log 

The DATALOG package contains a set of limits on each 
parameter, and a task that records all analog parameters when- 
ever one or more of the limits is exceeded. 

When the program is started, the operator is prompted for 
the name of the target, the round trip light time (used to cal- 
culate the TIMELEFT parameter) and a file name for the 
data. The file name is passed to the LOG task, which creates 
the file and writes a header consisting of the data, the target, 
an optional smoothing factor, and the names of the parameters 
that will be recorded. The LOG task waits until all parameters 
have been measured, then writes their initial values to the data 
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file. One of the parameters recorded is the measurement time, 
which time tags the data record. The format used for record- 
ing is Comma Separated Values (CSV) that is, the ASCII 
representation of the values, with commas separating them. 

The decision on when to record data is based on a “record 
on change” algorithm. The DATA LOG package contains a 
limit on the absolute value of change for each measured 
parameter. Every time the data is updated, the LOG task com- 
pares the change in each parameter from the last value recorded. 
If the change on any parameter exceeds the limit for that 
parameter, it sets a flag. If no parameter has changed more 
than its limit, the data is kept in temporary storage. When a 
change does occur, two sets of data are recorded: the last 
measurement that did not show a change and the measurement 
that did show a change. This technique simplifies the plotting 
of the data. 

IV. Data Reduction 

Supercalc 4 is used for data reduction. One of its options is 
reading CSV data files, and it allows keyboard macros that can 
import data, set the scales and labels, and plot the data with a 
few keystrokes. Figure 4 is a graph of the transmitter output 
power during a recent experiment. This graph plots the total 
output power (P TOT), along with output powers of each 
klystron (POUT 1 and P OUT 2). Since all operating 
parameters are recorded, additional graphs, such as beam volt- 
age and drive power, can be produced to determine the source 
of the variations in the output power. The output power can 
also be supplied to the scientists for calculation of target 
albedo or cross section. 

V. Results and Discussion 

A. Speed and Accuracy 

The data collection system reads four turbine flow meters 
with 1 -hertz accuracy. The actual frequencies range from about 
900 hertz to 1200 hertz, so this is about 0.1 percent of the 
actual reading. There are 18 RTD temperature sensors, which 


are read to about 0.1°C accuracy at a rate of 25 readings per 
second. The system measures 32 dc voltages with 5%-digit 
precision (100 readings per second) and 120 dc voltages with 
3%-digit precision (160 readings per second). Using the over- 
lapping techniques described above, it makes a measurement 
of every parameter every 3.5 seconds. 

The major limit on the time for a measurement cycle is the 
integration time of the precision voltmeter. Because one scan- 
ner is transmitting its data while the other is making measure- 
ments, an increase in the data transfer rate would not decrease 
the cycle time. The counter card needs 1 second of integration 
time to get 1 -hertz accuracy, but it reads all channels simul- 
taneously, and is independent of the voltmeter, so it is not 
limiting the overall speed. 

If necessary, some speed improvement is possible in the 
future by using a higher-speed voltmeter (option 44702A) for 
some of the less critical measurements. This unit has only 
12 bits of resolution (plus sign) and a maximum input voltage 
of 10 volts, but can read up to 100,000 channels per second. 

B. Problems 

During the initial phases of the development, efforts were 
made to allow the system to function with bad or missing 
sensors by marking individual data items as “UNKNOWN,” 
but measuring and recording the rest of the data. This effort 
was only partially successful. At present, it can detect and 
allow for problems in the RTD temperature sensors and fail- 
ures of an entire data collection unit. A problem at the card 
level within a data collection unit prevents reading of other 
cards, and a failure of the fiber-optic link prevents reading of 
the local data collection unit. 

C. Open Items 

Not yet implemented in the system are: (1) better tech- 
niques of detecting and flagging hardware failures, (2) more 
precise limits on parameters used in the “record on change” 
algorithm, (3) other tools for data reduction, and (4) real-time 
graphical representation of certain parameters. 
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ORIGINAL PAGE IS 
OF POOR QUALITY 


type PARAMETER is ( 

— analog parameters 

— Klystron 1, upstairs system 

LOAD_FLOW_l , LOAD_TIN_l , LOAD_TOUT_l , LOAD_TURB_l , 

COLL_FLOW_l, COLLJTIN_l, COLL_TOUT_l , 

BODY_FLOW_1 , BODY_TIN_l , B0DY__T0UT_1 , 

MAG_FLOW_l, WASTE_TOUT_l, 

— Downstairs system 

COLL_I_l, FIL_I__1 , FIL_V_1, MAG_I_1 , 

P_°UT_1, P_DRIV_1, P_REFL_1, VAC_I_1, 

RPA_ONE_l , RPA_TWO_l, 

— calculated parameters 
P_L0AD_1 , P_WASTE_1 , 

— Klystron 2, upstairs system 

LOAD_FLOW_2, L0AD_TIN_2 , LOAD_TOUT_2 , LOAD_TURB__2 , 

COLL_FLOW_2, C0LL_TIN_2, COLL_TOUT_2 , 

BODY_FLOW_2 , BODY JT IN_2 , BODY_TOUT_2 , 

MAG_FLOW_2 , WASTE_T0UT_2 , 

— Downstairs system 

COLL_I_2 , FIL_I_2, FIL_V_2, MAG_I_2 , 
p _ OUT _ 2 / P_DRIV_2 , P_REFL_2 , VAC_I_2 , 

RPA_ONE_2, RPA_TWO_2, 

— calculated parameters 
P_LOAD_2 , P_WASTE_2, 

— Common Parameters, Upstairs 
WG_TURB, WG_TIN , WG_TOUT, 

P_TOT_TURB, P_TOT_TIN, P_TOT_TOUT, 

— Downstairs 

BODY_I , BEAM_V, BEAM_I , CB_TIME , 

VACJV, PHASE, VDC_2 8 , P_TOT, 

— Misc 

P_WASTE, — sum of two waster loads 
TIME_LEFT, — derived from RTLT 
MEASUREMENTJTIME , 

— calculated 
PJTOT__CALC, 

— Indicators 

— Klystron 1 

UA_5_1, UA_50_1, UA_500_1, UA_5000_1, UA_50000_1, — Vac Ion Scale 
FIL_RAISE__1 , MAG_RAISE_1, DRIVE_RAISE_1 , 

FIL_L0WER_1, MAG_L0WER_1 , DRIVE_L0WER_1 , 

— Klystron 2 

UA _ 5 _ 2 / UA_50_2 , UA_500_2, UA_5000_2, UA_50000_2, — Vac Ion Scale 
FIL_RAISE_2 , MAG_RAISE_2 , DRIVE_RAISE_2 , 

FIL_LOWER_2, MAG_L0WER_2 , DRIVE_L0WER_2 , 

— common 

BEAM_READY, BEAM_ON, BEAM_RAISE, BEAM_LOWER, 

S_BAND_DSN, S_BAND_RADAR , X_BAND_RADAR, 

ANT_POS , LOAD_POS , 

HE_ON, MAIN_HE_ON, MG_ON, DRIVE_ON, 

PHASE_0, PHASE_18 0, IL__OPEN, 

PGM_MODE, COMP_IF_ON, 


Fig. 3. Ada declaration of type PARAMETER. 




— Warnings 

ALI__HE_ON, AUX_HE_ON, RESIST_IN, RESIST_OUT, 
HE_TANK_LOW , HE_TANK_ PRESS , HE_FANS / 

WG_PRESS, TR_FLOW, 

— Interlocks 

— Klystron 1 

FIL_TD_1 , FIL_UC_1 , COLL_OC_l , F0CUS_UC__1, 
REF_PWR_0NE_1 , REF_PWR_TW0_1 , REF_METER_1 , 
ARC_DET_ONE_l, ARC_DET_TWO_l , 

COLL_FLOW_LO_l , B0DY_FL0W_L0_1 , 

LOAD_FLOW_LO_l , DRIFT_FLOW__LO_l , FIL_FL0W_L0_1 , 
VAC_PWR_1 , CB_MAG_1, 

— Klystron 2 

FIL_TD_2 , FIL_UC_2 , C0LL_0C_2 , FOCUS _U C__2 , 
REF_PWR_0NE_2 , REF_PWR_TW0_2 , REF_METER_2 , 
ARC_DET_0NE_2 , ARC_DET_TWO_2 , 

COLL_FLOW_LO_2 , B0DY_FL0W_L0_2 , 

LOAD_FLOW_LO_2 , DRI FT_FL0W_L0_2 , FI L_FLOW_LO_2 , 
VAC_PWR_2 , CB_MAG_2, 

— common 

ELEVATION, PS_DOOR, PA_DOOR, CB_DOOR, 
TXR_CONFIG, MICROWAVE, CB_TEST , CB_FIRED, 
FAST_BODY , SLOW_BODY, IGN_PWR, BODY_OC, 

DC_OV, DC_OC , PHASE_FAIL, TR_OIL_LOW, 

HV_ZERO, MOTORIST, MOTOR_LO, GEN_LO , ALI_HE_OT, 

— other parameters 
OUTPUTJTO, CONFIG, 

TARGET, TIME, DATE, UNUSED, NONE) ; 


Fig. 3 (contd). 
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Fig. 4. Typical data plot. 
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The gain , phase, and pointing performance of the DSN 70-m antennas are investigated 
using theoretical antenna analysis computer programs that consider the gravity -induced 
deformation of the antenna surface and quadripod structure. The microwave effects are 
calculated for normal subreflector focusing motion and for special fixed-subreflector con- 
ditions that may be used during the Voyager 2 Neptune encounter. The frequency stabil- 
ity effects of stepwise lateral and axial subreflector motions are also described. Compari- 
sons with recently measured antenna efficiency and subreflector motion tests are pre- 
sented. A modification to the existing 70-m antenna pointing squint correction constant 
is proposed. 


I. Introduction 

With the approaching Voyager 2 Neptune encounter, sched- 
uled for August 25, 1989, it is important to characterize the 
operation of the DSN 70-m antenna network in its newly 
upgraded configuration. It was predicted that the upgrade 
would improve antenna gain by 55 percent (1.9 dB) or more, 
relative to the gain in its 64-m configuration. This has been 
accomplished and verified for the three 70-m antennas by effi- 
ciency measurements using radio sources as reference stan- 
dards. The gain/noise temperature ratio has been increased 
even more due to a decrease of system noise temperature at 
all elevation angles. A more complete description of these 
results will be reported in future TDA Progress Reports. 

Normally, antenna subreflector (SR) focusing motion is 
made to optimize the gain at all elevation angles. This subre- 


flector motion is necessary because the antenna changes shape 
and the quadripod moves (a total of three inches laterally and 
one-half inch axially) as a function of elevation angle. Because 
the best-fit focus of the deformed main reflector is generally 
not coincident with the subreflector focus (after quadripod 
movement), sub reflector movement commanded by the sub- 
reflector controller (SRC) is necessary. Although the antenna 
gain is optimized by this method, pointing is not preserved. A 
“squint correction” to the predicted elevation angle for SR 
lateral motion along the antenna Y-axis (vertical for AZ-EL 
antennas pointing at the horizon) is made by offsetting the 
predicted elevation angle as a function of subreflector posi- 
tion only (not elevation angle). Somewhat more complete 
descriptions of these antenna and subreflector motions for the 
64-m antennas are given in [1] and [2] . The study in [1] grew 
out of a problem that occurred during the Voyager 2 Uranus 
encounter wherein a mispositioned subreflector degraded the 
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gain and pointing capabilities of the DSS43 (Australia) 64-m 
antenna. A recent study [3] compares 70-m antenna perfor- 
mance predicted by both GTD (Geometrical Theory of Dif- 
fraction) calculations and traditional ray tracing methods. 

For the purposes of frequency stability of the DSS-43 
70-m antenna during the Voyager 2 Neptune closest approach 
(approximately 0700 to 0915 GMT, Earth-received time, over 
an elevation angle range of approximately 45 to 70 degrees), 
two modifications of subreflector positioning are being con- 
sidered: (1) fix the subreflector in its axial (Z) motion at a 
position corresponding to an elevation angle of approximately 
60 degrees, or (2) fix the subreflector in both its axial (Z) and 
vertical (Y) motions at a position corresponding to an eleva- 
tion angle of 60 degrees. The subreflector X-axis position 
(horizontal) does not change, as there is no net gravity compo- 
nent in that direction, and hence no net change in the loading 
as a function of elevation angle. A previous study concerning 
the phase and frequency stability of Cassegrainian antennas 
(e.g., DSS-43 in its 64-m configuration and the present DSS-42 
HA-DEC antenna) is presented in [4] . Typical maximum sub- 
reflector axial (Z) rates of motion are 0.050 in./sec, resulting 
in an RF path length change of 0.090 in./sec. (In the study 
reported in [4] it was found that the ratio of change in RF 
path length to change in subreflector axial position was 1.8. 
(An earlier study [5] determined this factor to be 1.76.) For 
X-band frequency (8420 MHz), this results in a frequency sig- 
nature (for this example) of about 0.064 Hz. This low fre- 
quency signature confuses radio science data analysis, hence 
the proposal to fix subreflector motion. The purpose of the 
study presented here is to determine the gain and phase effects 
of this nonstandard antenna configuration. It should be noted 
that the DSN 70-m antennas have somewhat different subre- 
flector velocities than those used for the 64-m antennas. The 
maximum z-rate is not typically used. Actual operational 
methods will be discussed in Section III. 

Figure 1 shows the elevation angles of the three 70-m 
antennas on the day of Voyager 2 Neptune encounter. The 
proposed tracking plan at DSS43 entails normal tracking using 
conscan and normal sub reflector Y-Z focusing until an eleva- 
tion angle of about 40-43 degrees is reached. The subreflector 
would then be moved abruptly in Z, fixed in its 60-degree 
position, and conscan would be turned off. (At the time of the 
writing of this report, the Voyager 2 radio science plan is to 
fix the subreflector motion in Z-axis only.) The spacecraft 
would be tracked in the fixed -subreflector mode up to about 
70-degree elevation (covering the period between the two 71K- 
Ring Earth Occultation events). At an elevation of about 
70 degrees, the subreflector would be returned to automatic 
mode and conscan turned on for the remainder of the pass 
except for a short interval during Triton occultation. 


The structural model of the DSN 70-m antenna surface was 
developed by JPL’s Ground Antenna and Facilities Engineer- 
ing Section (R. Levy and M. S. Katow, private communica- 
tion). For reference, the model is designated JRMFL-FX-03J/ 
70M-MAR88, and at this time (August 1988) is the most recent 
model being used to describe antenna structural deformation 
as a function of gravity loading. The subreflector controller 
(SRC) model (P. Lipsius, private communication) used in the 
calculations here is given by equations describing subreflector 
motions along the Y (lateral) and Z (axial) directions: 

Y = - 0.0369 A - 6.45 16 i? + 0.0257 
Z = -1.7270 A - 0.0732 B - 0.0982 

where 

A - sin(45°) - sin (EL) 

B = cos(45°) - cos(EL) 

EL = elevation angle 

These equations are programmed in PROMs in the SRC. Addi- 
tional SR position offsets are computed and applied in field 
operation by numerous measurements made to maximize effi- 
ciency at a particular elevation angle (D. Girdner and W. Wood, 
Goldstone Deep Space Communications Complex, private 
communication). Typically, both best antenna shape and maxi- 
mum efficiency are obtained at 45 -degree elevation, and bias 
terms (different from SR position offsets) arise from uncon- 
trollable nonzero values occurring in installation of position 
readouts, etc. All 70-m antennas have identical PROMs in the 
SRC containing identical constant terms, however operator- 
applied inputs from the Local Monitor and Control (LMC) 
console could vary from antenna to antenna as a result of dif- 
fering local conditions. Errors in panel setting, later structural 
modification, or special cases may result in changes from the 
45-degree optimum position. For the calculations performed 
here, it is assumed that perfect shape and maximum gain are 
obtained at 45 -degree elevation. For these GTD calculations 
only, the constants in the above equations describing subre- 
flector position were made equal to zero. 

It should be noted that in the GTD computer programs cur- 
rently used to calculate dual-shaped reflector performance, the 
description of subreflector and feed relative positions requires 
special attention. It is important to note that the net subre- 
flector movement in the main reflector coordinate system is a 
combination of both the quadripod movement and the subre- 
flector movement relative to the quadripod. As the feedhorn is 
referenced to the sub reflector coordinate system, yet remains 
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fixed (to a good approximation) in the main reflector coordi- 
nate system, its position relative to the “moving” subreflector 
coordinate system must also be calculated. 

II. Computed Gain Effects 

Figure 2 shows the GTD-computed gains for the three sub- 
reflector modes of operation: (1) operating normally in auto- 
matic mode, (2) Z fixed in its 60-degree elevation position, 
and (3) Y and Z fixed in their 60-degree elevation positions. 
The computed gains include only the losses due to aperture 
illumination, feed and subreflector spillover, phase error, and 
cross-polarization. Effects that are not included are waveguide 
and dichroic plate loss, quadripod blockage, VSWR, surface 
roughness, and several other items. The predicted theoretical 
operational antenna gain is about 1 dB lower than the values 
shown in Fig. 2. Table 1 shows all the components going into 
the design expectation of operational gain (at the peak gain 
elevation angle of 45 degrees). Note that the design expecta- 
tion of 74.39 dBi (71.94-percent efficiency) is 0.96 dB below 
the GTD-calculated value of 75.35 dBi (89.74-percent effi- 
ciency). The actual measured antenna efficiencies for the 70-m 
network are about 68 percent (peak, without atmosphere, at 
a 45 -degree elevation angle), corresponding to an antenna gain 
of 74.15 dBi. The quarter-dB difference between design expec- 
tation and actual performance will be investigated to test the 
validity of the values presented in Table 1 . For this study the 
absolute values of gain and efficiency are not critical to the 
phase-effect problem. It is the difference between the normal 
automatic mode of SR operation and the two fixed-SR modes 
that gives the effect import for radio science data. 

It is seen in Fig. 2 that the Z-fixed SR condition shows less 
than 0.1 -dB loss of gain over the elevation range of 45 to 70 
degrees. The actual Z-mispositioning of the subreflector re- 
mains small over that elevation angle range, compared to the 
mispositioning resulting from fixing Y also. For example, at 
45-degree elevation, the subreflector is mispositioned 0.259 
inches in Z and 1.330 inches in Y, relative to their positions at 
a 60-degree elevation angle. From previous studies, 1 the loss 
arising from the Z-mispositioning is predicted to be 

AC = 2.18(0.259) 2 dB 
= 0.146 dB 

This loss is somewhat largef than the 0.1 dB shown in Fig. 2. 


*R. Levy, “Gain Losses for Non-Optimal Antenna Subreflector Off- 
sets,’ 1 JPL IOM 3325-88-009 (internal document), February 5, 1988. 


For the case of Y and Z fixed, there also exists a misposi- 
tioning of the sub reflector in Y (at elevation angles differing 
from 60 degrees), and this mispositioning results in about 1 .5 
beamwidths of scan at 45 -degree elevation. Pointing effects 
will be discussed later. From the previously quoted study by 
R. Levy, the gain loss at 45 degrees associated with the 1 .330- 
in. mispositioning of the subreflector in Y is given by 

AC = 0.28(1 .330) 2 dB 
= 0.495 dB 

This is almost exactly the difference between the Z-flxed and 
Y/Z-flxed curves in Fig. 2. It appears then that the R. Levy 
model may overestimate the effect of the Z-mispositioning of 
the subreflector. A further comparison of the R. Levy model 
and the GTD-calculated gain losses should be made. 

For preservation of gain, it is seen that fixing the subreflec- 
tor Z-movement at its 60-degree position has minimal effect 
over the 45- to 70-degree elevation Neptune encounter period. 

III. Computed Phase Effects 

Figure 3 shows the far-field phase effect associated with 
normal SR movement. There is a total of more than 650 
degrees of phase change over the elevation range of 5 to 90 
degrees. This phase change is due almost entirely to the Z- 
movement of the subreflector. 

A model for the particular curve shown in Fig. 3 is 
phase = 814 - 718 sin(EL) 
where EL = elevation angle, degrees. 

Note that the constant term in the above equation is 
arbitrary, and could be set to zero. What is important is the 
phase variation with elevation angle. 

The net Z subreflector movement over this range is 1.048 
in. (1.504 in. commanded by the SRC and -0.456 in. from 
quadripod movement) relative to the main reflector vertex, 
and includes both the effect of the SR movement and the 
deformation of the quadripod structure. Tests carried out at 
Goldstone 2 and subsequent GTD calculations (described 
later in this report) indicate that the effect of Z subreflector 


2 R. Riggs, “Preliminary Report of the Results of the Radio Science 
Tests Conducted at DSS-14,” JPL IOM RLR-88-10 (internal docu- 
ment), May 10, 1988. 
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movement for a 70-m antenna (K-band feed in the XKR cone) 
is to change the path length by about 1.76 times the amount 
of movement. For Y subreflector movement using the same 
feedhorn, the path length factor was determined to be 0.093. 
A 1.048-in. net subreflector Z-movement thus gives 1.844 
inches of path length change, or about 474 degrees of phase at 
8420 MHz. The additional phase change is due to the deforma- 
tion of the main reflector itself. The frequency signature of 
this phase change if the SR movement were absolutely smooth 
and continuous is probably of little consequence for radio sci- 
ence investigations. For an elevation change from 45 to 50 
degrees over a time interval of 0.410 hours, the phase change is 
42.4 degrees, resulting in a frequency signature of about 0.08 
mHz. The actual Y and Z SR movements are made in abrupt 
steps, resulting in a much higher frequency signature. 

Figure 4 shows a comparison among the normal, flxed-Z, 
and fixed-Y/Z subreflector conditions. Note that it is the fix- 
ing of the SR Z-movement that predominantly gives rise to the 
much reduced phase change. The residual change of phase 
results from both the quadripod movement and main reflector 
deformation. These deformations work in opposite directions 
as far as path length changes are concerned, and hence the net 
phase change effect is small. 

In contrast to the smoothly changing phase effects experi- 
enced when both Y and Z subreflector positions are fixed, the 
actual operational phase changes occur “abruptly” when the 
subreflector position is changed in small steps. For the 70-m 
antenna, the subreflector position is updated when an error of 
more than 0.009 in. is detected in either the Y or Z positions. 
The subreflector is commanded to move until the position 
error becomes less than 0.007 in. With overshoot due to motor 
and gear system inertia, the total movement is thought to be 
0.005 to 0.006 in. (K. Nikbakht, private communication). The 
maximum sustained rate that the subreflector is capable of 
moving is 1 in. /min (0.017 in./sec) in the Y direction, and 
3.14 in./min (0.052 in./sec) in the Z direction. The subreflec- 
tor is not generally operated in this mode, however, as the 
mechanical stresses are quite large. In actuality, gradual motor 
acceleration and reduced maximum speed result in the subre- 
flector movements over the 0.005-0.006 in. range occurring in 
approximately 2 sec. The average rate resulting is thus about 
0.003 in./sec, substantially lower than the Y and Z maximum 
rates. It will be shown in Section V that for the XRO cone on 
all 70-m antennas, the GTD -computed path length change for 
Y-movement is 0.0444 times the subreflector movement and 
for Z-movement is 1.671 times the subreflector movement. 
(The XRO cone is located at the upper left cone position when 
looking into the face of the main reflector.) Thus, the average 
pathlength rates for Y and Z, respectively, are about 0.00013 
in./sec and 0.0050 in./sec. For a frequency of 8420 MHz, this 
results in Y and Z doppler frequency signatures of about 0.093 


MHz and 3.6 MHz, respectively, with a probable uncertainty of 
at least 30 percent. 

IV. Computed Pointing Effects 

Figure 5 shows the actual antenna beam pointing resulting 
from the three different subreflector positioning schemes. In 
the case of the normal automatic SR movement, it is seen that 
the beam moves upward somewhat faster than the increasing 
elevation angle. Indeed, from a 45- to 80-degree elevation 
angle, the beam has moved up an additional 142 millidegrees 
as a result of the attempt to maintain optimum gain by proper 
positioning of the subreflector. In order to maintain proper 
pointing, the applied squint correction is the negative of this 
beam movement. 

It can be seen from Fig. 5 that over the elevation range of 
45 to 70 degrees, the Z-Fixed subreflector condition results in 
negligible additional pointing error, to within the resolution of 
these calculations (1 millidegree). A simple analysis [6] , [7] 
shows that even with the asymmetric off-axis tri-cone struc- 
ture of the 70-m antennas, the 0.259-in. mispositioning of Z 
(for the Z-fixed subreflector condition) at 45 -degree elevation 
results in a pointing error of less than 0.5 millidegrees. The 
reason for this is that Z-errors are just in-out position changes 
of the subreflector along the main reflector Z-axis. Fixing Y 
results in a deviation from normal pointing. The squint correc- 
tion operates according to the Y-position of the subreflector 
relative to the quadripod structure, not from the net subre- 
flector position, which includes quadripod movement also. It 
will be shown in Section V that the current (August 1988) 
squint correction as implemented in the Antenna Controller 
Subsystem (ACS) does not accurately follow the curve shown 
in Fig. 5. It has been necessary to employ a systematic error 
correction table to correct for such discrepancies. For the sub- 
reflector to be both Y- and Z-position fixed, the squint correc- 
tion will be maintained constant over the 45- to 70- degree 
elevation range, and substantial pointing errors will result. For 
radio science purposes, it appears that fixing only Z introduces 
no effect that will seriously compromise the pointing capabili- 
ties of the antenna. The systematic pointing error correction 
developed with the SR in normal automatic mode should then 
be correct for SR Z-fixed operations. 


V. Effects of Lateral and Axial Subreflector 
Motion at a 45-degree Elevation Angle 
and Modifications to Existing Squint 
Correction 

A series of GTD calculations was made at 8420 MHz to 
assess the effects of lateral and axial subreflector motions on 
gain, phase, and pointing for both the X-band (XRO) and 
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K-band cones on the 70-m antennas. Looking into the antenna 
face with the antenna pointing at the horizon (elevation angle 
equal to 0 degrees), the K-band cone is at the bottom (6- 
o’clock, 0-degree clock angle) and the X-band cone is at the 
upper left (10-o’clock, 120-degree clock angle). Table 2 pre- 
sents the results of these calculations for the X-band cone with 
the antenna positioned at a 45 -degree elevation angle (and 
hence with a perfect surface). It is seen that for the move- 
ments shown, substantial (tenths of a dB) gain changes result. 
For Y-subreflector movements, small phase changes result, 
whereas for Z-movements, substantial phase changes occur. 
From these calculations, dimensionless “K-factors” can be 
derived linking phase change to subreflector movement (inches 
of path length change/inches of subreflector movement). Thus 
phase change (degrees) at any frequency can be determined. 
For the X-band (XRO) cone at a clock angle of 120 degrees, 
the values of K determined were: 

K y = +0.0444 

K z = -1.671 

These are consistent with the propagation sign convention, 
exp(-/kr ), in the GTD program where increasing path length 
results in more negative (or decreasing) phase. 

For the K-band cone, where Y-movements of the subre- 
flector are directly “toward” or “away” from the cone, a 
surprising result occurs. Although the subreflector movement 
is “lateral” in both cases, the value for the K-band cone turns 
out to be 

K y = -0.0878 

This value is almost exactly double the value (but of opposite 
sign) for the X-band cone. The K-band K y and K z compare 
well with the experimentally measured values of -0.093 and 
-1.76, respectively. 

For subreflector movements purely lateral (at 90 degrees) 
to the cone direction, it is found that 

K y = 0.00078 

which is deemed (for the purposes of this report) to be zero. 
(The nonzero result is undoubtedly due to roundoff error in 
the calculations.) 

It is postulated that the value of K (lateral) is proportional 
to both the distance of the feed from the main reflector axis 


and to the component of subreflector movement directly 
“toward” or “away” from the feed in question. Thus, for a 
feed located directly at the center of the main reflector, very 
small subreflector movements would result in negligible phase 
changes. Figure 6 shows the lateral K-factor as a function of 
clock angle for cones located on three subreflector focus cir- 
cles of varying diameter. The diameters increase in size from 
Rj to R 3 (Rj = 0.251^, R 2 = 0.5R 3 ). The curve with maxi- 
mum amplitude (R 3 ) is the K-factor curve for cones located 
along the 70-m antenna subreflector focus circle (radius equal 
to 108.03 cm). Phase effects determined from these curves 
may assist in future positioning of outrigger horns or addi- 
tional cones on the existing 70-m antennas. 

The existing 70-meter squint correction, as implemented in 
the Antenna Controller Subsystem (ACS), uses the subre- 
flector Y-position multiplied by a constant (0.0342 degrees/ 
inch of SR Y-axis position) to calculate the amount of correc- 
tion to the elevation angle needed to maintain the beam on 
target during a track. The existing constant as determined in 
[3] was verified in this study using similar computational 
methods. However, this constant predicts the amount of beam 
movement due to subreflector motion only; it does not 
account for additional beam movement arising from quadripod 
motion and main reflector deformation as the elevation angle 
changes. It is found from this study that another constant 
more appropriately predicts beam movement, using the SR 
Y-position as an indicator of total antenna “condition,” 
rather than as the entire cause of beam mispointing. 

Figure 7 shows the actual GTD -calculated beam-peak off- 
set as a function of normal SR Y-position. It is seen that at a 
90-degree elevation angle, more than 30 millidegrees of point- 
ing error exist. This is larger than the 70-m 3-dB beamwidth! 
From this curve a new squint correction constant may be 
found: 

K sauint = 0.04145 degrees/inch 


where the SRC Y-position (inches) is used as the indicator of 
squint correction needed. This new constant could be installed 
in the ACS as a replacement for the existing constant. (If this 
is implemented, new systematic pointing -error correction 
tables would have to be developed.) 

It should be noted that the use of this new constant during 
normal subreflector focusing tests will now result in mispoint- 
ing of the beam, as the original squint constant (0.0342) was 
really the correct one to use for conditions of subreflector 
movement only— it was not appropriate for normal tracking 
operations. 
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A new squint correction may be generated from the current 
squint correction plus a correction as a function of elevation 
angle (Fig. 8). The new squint correction as a function of SR 
Y-position (Y sr ) and elevation angle (EL) is given by 

SQUINT (Y sr , EL) = 0.0342 (Y gr ) - 0.0329 
+ 0.0468 cos (EL) 

If this new squint correction is implemented, the result will be 
a “hybrid” squint correction model using both the old con- 
stant as a function of SR position and an additional pointing 
correction as a function of elevation angle. The additional 
pointing correction will also be generated in the ACS. 


VI. Conclusion 

The extensive series of calculations described here indi- 
cate several methods by which the Voyager Radio Science 
team might avoid the confusing effects of rapidly changing 
phase in the August 1989 Neptune encounter data. It appears 
that fixing the subreflector Z-motion over the elevation range 
of 45 to 70 degrees adequately solves this problem without 
significant gain degradation. A useful result of the gain, phase, 
and pointing determinations is a possible modification to the 
existing squint correction used on the 70-m antennas. This 
modification substantially changes the value of the “squint 
constant” in the ACS and appears to more accurately model 
existing antenna pointing as a function of subreflector posi- 
tion during normal tracking operations. 
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Table 1. Design expectations for 70-m antenna with stovepipe 
feedhorn at 8420 MHz and 45-degree elevation angle 


Item 

Loss, dR 

Net gain, dBi 

100% area efficiency 


75.82 

GTD -included losses 
illumination amplitude 
illumination phase 
forward and rear spillover 
subreflector blockage 
m ^ 1 modes 
cross-polarization 

-0.47 

75.35 

Waveguide loss 

-0.07 


Dichroic plate loss 

-0.035 


VSWR 

-0.039 


Quad rip od blockage 

-0.454 


Antenna surfaces (0.7-mm ims) 
main reflector panel mfg. 
main reflector panel setting 
subreflector surface 

-0.192 


Stovepipe feed compromise 

-0.11 


Imperfect focus alignment 

-0.05 


Panel gaps 

-0.01 

74.39 

(= 71.94% efficiency) 



Table 2. Effects of lateral and axial subrefiector movement for 
X-band cone (clock angle = 120 degrees) 


Displacements 

GTD -computed 
gain, dBi 
at 8420 MHz 

GTD -computed 
phase, deg 
at 8420 MHz 

Y, inches 

Z, inches 

+1.0 

0 

75.07 

119.6 

+0.5 

0 

75.28 

114.2 

0.0 

0 

75.35 

108.5 

-0.5 

0 

75.28 

102.7 

-1.0 

0 

75.07 

97.0 

0 

+0.30 

75.24 

-20 

0 

+0.15 

75.35 

44 

0 

0.00 

75.35 

108.5 

0 

-0.15 

75.26 

173 

0 

-0.30 

75.06 
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Results: 


K y = +0.0444 
K z = -1.671 
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Fig. 1. Antenna elevation angles at Neptune Encounter, August 25, 
1989, Earth-receive time. 
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Fig. 3. 70-m antenna GTD-computed phase versus elevation 
angle, 8420 MHz, SR normal operation, arbitrary phase 
values. 
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Fig. 2. 70-m antenna GTD-computed gain versus elevation 
angle, 8420 MHz, with three different subreflector configura- 
tions. 
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Fig. 4. 70-m antenna GTD-computed phase versus elevation 
angle, 8420 MHz, for three different subrefiector configura- 
tions. 
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Fig. 5. 70-m antenna GTD-computed pointing versus elevation 
angle, 8420 MHz, for three different subreflector configurations. 



Fig. 6. Subreflector lateral movement K-factor as a function of feed 
clock-angle and radial distance from 70-m antenna main reflector 
axis. 
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Fig. 7. 70-m antenna squint correction versus SR Y-position, 
existing squint correction and needed beam correction, elevation 
angle indicated. 
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Fig. 8. 70-m antenna additional squint correction needed versus 
elevation angle. 
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This article presents the results of an investigation of ground antenna pointing errors 
which are caused by fluctuations of the receiver AGC signal due to thermal noise and a 
spinning spacecraft. Transient responses and steady-state errors and losses are estimated 
using models of the digital Conscan (conical scan) loop, the FFT, and antenna characteris- 
tics. Simulation results are given for the on-going Voyager mission and for the upcoming 
Galileo mission, which includes a spinning spacecraft. The simulation predicts a 1-sigma 
pointing error of 0.5 to 2.0 mdeg for Voyager, assuming an AGC loop SNR of 35 to 
30 dB with a scan period varying from 128 to 32 sec, respectively . This prediction is in 
agreement with the DSS 14 antenna Conscan performance of 1. 7 mdeg for 32-sec scans 
as reported in earlier studies. The simulation for Galileo predicts 1-mdeg error with a 
128-sec scan and 4-mdeg with a 32-sec scan under similar AGC conditions. 


I. Introduction 

In order to reduce the pointing error of the DSN ground 
antennas, a technique called Conscan has been successfully 
used for many years. In Conscan, angle tracking is accom- 
plished by scanning the antenna around boresight in a circular 
pattern with constant angular offset, called the scan radius. 
The basic theory of Conscan is given in [1]. In the present 
implementation of the Conscan technique, the downlink signal 
is processed by a Fast Fourier Transform (FFT) algorithm. 

In this article, the impact of downlink AGC fluctuations 
due to thermal noise and a spinning spacecraft on the pointing 
error of a ground antenna is investigated. In Section II the 
Conscan process is modeled as a digital phase -locked loop com- 
bined with the FFT algorithm. This analysis is subdivided into 
the following sections: 


A. Modeling the downlink signal 

B. Algorithm for estimating the pointing error 

C. Impact of spacecraft motion on the pointing error 

D. Impact of AGC SNR on the pointing error 

E. Conscan closed-loop model 

F. Pointing jitter and pointing loss 

G. Transient response 

Section III presents a computer simulation of the Conscan 
model. Predicted performance in terms of steady-state and 
transient responses, as well as time constants, are given as a 
function of loop gain, scan period, and signal-to -noise ratio. 
Specific topics discussed are: 
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A. Simulation model 

B. Simulation results-choice of loop gain and scan period 

C. Simulation versus actual performance for Voyager 

D. Predicted performance for Galileo 

E. Simulation results versus predicted performance 

II. Analysis 

A. Modeling of the Downlink Signal 

The Conscan technique uses AGC samples of the ground re- 
ceiver in order to estimate the ground antenna pointing angle. 
In general, the AGC samples are perturbed by several causes: 
by deliberate ground antenna scanning about its boresight, by 
thermal noise in the AGC loop, by spacecraft antenna mis- 
pointing, and by other factors such as changes in gain and 
weather conditions. 


p (t) - | ~R 2 +d* +6* + 2^(^cosco r r+^sinco r oj 1/2 

(3) 

where 

R = scan radius 

0 e = pointing error in elevation angle 
6 X = pointing error in cross-elevation angle 
co T = 27r/r, where T is the Conscan period 

For small pointing offsets, the pointing loss due to ground 
antenna offset can be approximated by the least square fitting 
of a parabola to the antenna gain pattern. In our analysis we 
let 

/,(/?) = Kf (dB) (4) 


In our analysis, we first will assume that the AGC loop 
operates at very high SNR so that the AGC fluctuations due to 
thermal noise can be neglected. The effect of thermal noise 
will be addressed later in this analysis. 

Let P c be the average carrier power reaching the ground 
receiver when the receiving antenna is perfectly pointed and 
let Q(t) be the instantaneous pointing offset of the receiving 
antenna. Then, with imperfect ground antenna pointing, the 
signal power reaching the ground receiver will be 

r(t) = P c + / r [0(f)] (dBm) (1) 


where K r is a constant. 

Combining the effects of scanning the ground antenna and 
a spinning spacecraft, the downlink AGC signal will be of the 
form 

r{t) = P c +l r [0(f)] + 2 >, cos (co , t + <p.) (5) 

In Conscan implementation, the received AGC signal 
power, r(f), is sampled every I seconds and stored in a one- 
dimensional array. Inserting Eqs. (3) and (4) into Eq. (5) and 
making T - IN and t - Ij , we obtain the downlink AGC signal 
at the sampling instants , namely, 


Where / r (*) represents the power loss due to pointing offset of 
the receiving antenna (a negative number in dB). 


r , = fc + M* 2+ 0* + O] 


When a spinning spacecraft is tracked, r(t) will experience 
periodic fluctuations of the form 


+ 2 K r R 



+ 9 sin 

e 



M 

^ A:, cos (aj.t + 4).) (2) 

1=1 

where K t represents the maximum power deviation about the 
mean (in units of dB) at the frequency co f (which will be some 
multiple of the spacecraft’s spin rate). Both K f and u> t are 
determined by the spacecraft’s antenna gain pattern and 
dynamics. 

The instantaneous pointing offset of the ground antenna, 
according to [1] , can be expressed by the following equation 


+ [£*,c°s («,//+*,)] (6) 

where the subscript / refers to the /th sample of a scan cycle, 
j = 0, . . . , A-l. Subscript i refers to the /th contribution to 
the signal. The variables <p i are random phases generated by a 
spinning spacecraft (for example, by its wobble and nutation). 
In general the spacecraft’s spin rate is faster than the scan rate 
and should not be a multiple of the later (otherwise the scan 
rate must be changed accordingly). In this case, the phases <p i 
will be approximately uniformly distributed in the {0, 27r) 
interval. P c as well as K ( (i = 1, . . . , M ), R, 6 X , and 0 e are 
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assumed to be constant during many scan periods. Equa- 
tion (6) can be written as follows 

r. = P + c. + n. (7) 

where P represents the terms contained in the first pair of 
square brackets of Eq. (6), which correspond to the dc com- 
ponent; Cj represents the terms contained in the second pair of 
square brackets, which include the signal variation produced 
by scanning the ground antenna in a circular pattern, where 
Q x and d e are the pointing errors that we want to correct; and 
rij represents the terms inside the third pair of square brackets, 
which are the signal fluctuations produced by the spinning 
spacecraft. 

B. Algorithm for Estimating the Pointing Error 

In the present implementation of the Conscan technique, 
at the end of a scan period, N AGC samples are Fourier trans- 
formed with an FFT algorithm. The antenna pointing errors 
are estimated from the first component of the FFT (see 
Fig. 1). Since the FFT is just a fast implementation of the 
Discrete Fourier Transform, we will apply the theory of DFT 
to estimate the impact of a spinning spacecraft on the Conscan 
signal. 

Let D k be the /rth component of the Discrete Fourier 
Transform operator defined by 



We rewrite this equation as follows 


Imaginary part 

! Nj(0) = 0 
KRQ e + *,(!) 
N;(k) 


for k = 0 

for k = 1 (1 1) 

otherwise 


where P equals the sum of terms in the first pair of square 
brackets in Eq. (6), whose value is of no interest in our analy- 
sis. As we see from Eqs. (10) and (11), 6 X and 6 e can be esti- 
mated from R r {\) and /? 7 (1), namely, 


KR 

d * KR 

r 


( 12 ) 


N r (1) and Nj ( 1 ) result from the modulation of the downlink 
signal produced by a spinning spacecraft. Since the above 
terms superimpose to the terms containing d x and 6 e (see 
Eqs. 10 and 11), they have the effect of an additive noise 
which corrupts the estimation of the ground antenna pointing 
error. In what follows, the impact of A^(l) and A/ ( 1 ) on Tf y 
and (T will be investigated. 

C. Impact of a Spinning Spacecraft on the Pointing 
Error 

We begin by taking the DFT on n f , the terms inside the 
third pair of square brackets in Eq. (6), namely 

D k (n) = D k cos (co.// + 0 ( .)j 

= N(k) = N R (k) + iNj(k) (13) 


D k (r) = R(k) = R^ + iR^k) 


( 9 ) where N R (k) and iN,(k) are the real and imaginary parts of 
N(k). Using Eqs. (A-21 ) and (A-23) of the Appendix we obtain 


where R R (k ) and R f (k) are the real and imaginary parts of the 
/rth component of the DFT, k = 0, 1, . . . , AM. Carrying out 
the above DFT operation on Eq. (6) we obtain 


Real part 


! F+N r (0) 
K r R0 x +N r (1) 


for k = 0 
for k = 1 
otherwise 


( 10 ) 


(2^) £^‘ sin ^0/ 

X [cos0.(C 1( . + C 2 ,.)-sin 0,.(C 3 . + C 4 .)] 

(14) 

Nj(l) ~ [ 2 ^) 2 S ' n 'fo/ 

' ’ all i 

X [cos 0.(C 3 . - C 4 .) - sin 4> j (-C u + C M )j 
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where 




(17) 


'1 i 


eos7 w 


C 2, = 


cos7 2< 

sin ^i 


/if 


Fj (/)(F.sin 7 0; .) 2 


C 3 , " 


Sin >lf 

sin 73 . 


c - sm72 ' 

4 >‘ sin7 4 . 


where the transfer functions for the real and imaginary parts 
are obtained from Eq. (16), namely 


y ot = 


n IN 


y u = tt 


/(AM) 1 

T. N 


y., = rr 


I(N- 1) 1 

~f, N 


- I 1 


nv> - 


81V 2 


(C 1 , + C ! ,) 2 *(C SI + C 4 / 


F ' (0 ■ (ik) [ (C 7'- C -' )2 +( - C .i tC ai) J ] 


(18) 


*31 



JV 


7-. = TT 
* 4 l 


JL + 1 

r. fv 


(15) 

When 0. are uniformly distributed, the evaluation of A^(l) 
and A^(l) is straightforward. It is shown in the Appendix that 
N r (1) and A^(l) are processes with zero mean, and variances 
(expressed in dB 2 ) of 

varjTyi)} = o 2 Rs = ZiK'SinyJ 2 

W / all t 

x [(c 1i . + c 2i .) 2 +( c 3 . + c 4< ) 2 ] 


Figure 2 shows the frequency response of F 2 (f) = F R (f) 
+ Fj 2 (f) y the magnitude squared of the first FFT’s component. 
Note that the transfer function of the FFT processing is peri- 
odic. The period equals /, which is the AGC sampling time 
(usually 1 sec). So, the frequencies/) (/. = cj i -/27t), which are 
multiples of 1/7, hurt the Conscan estimator the most. Minima 
of F 2 (f) are 23 dB below the maximum. Note that F 2 (f) 
has nulls at multiples of l/7\ Figure 2 also shows the ratio of 
F R (f)/F/(f) versus frequency. 

With this information about the properties of the transfer 
function of the FFT algorithm, we can select the Conscan period 
T so as to minimize the impact of unwanted frequencies, 
CO;. Ideally, we would like to have freedom in selecting both 
the AGC sampling time 7, and the Conscan period T y so that 
the unwanted frequencies will fall at the nulls or where the 
FFT’s frequency response is minimal. For example, we would 
like to select the sampling time 7 so that the following condi- 
tion is met 


var {(V/l)} = a 2 = LM X) (*.sin7 0l ) 2 

J all ; 

X t C 3,-- C 4,) 2+ (- C l 1 + C 2,) 2 ] 

(16) 

In order to gain more insight into the operation of the Con- 
scan algorithm, we can think of the FFT as digital filtering. In 
this context, we can compute the contribution to the variances 
o Rs and a* in terms of the zth modulation component of a 
spinning spacecraft and the FFT’s transfer function, namely 



where coj is the most significant component of the spin rate 
and n is an integer. 

D. Effect of AGC SNR on the Pointing Error 

Again let P c be the nominal carrier power reaching the 
ground receiver and SNR be the signal-to-noise ratio in the 
AGC loop. Then the noise variance in the AGC loop will be 
°n = Fj SNR . The instantaneous signal plus thermal noise 
power of the /th AGC sample will be 

r. = 20.0 log (y/T+V Nj ) (19) 
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where V Nj - is a zero mean Gaussian random variable with vari- 
ance o^. The standard deviation of the r ; * sample (assuming 
that only thermal noise is present) will be approximately 


a 



(dB) (20) 


It is shown in the Appendix that the variances of the real 
and imaginary parts at the output of the FFT in terms of the 
variance of the AGC thermal noise are 


oL - O’ - ^ (dB 2 ) 


Rt 


It 


( 21 ) 


The DFT algorithm (modeled here as an analog operation) 
multiplies this input by the vector 


2 nt . . 2nt 
C0S AT * sm at 


integrates from 0 to N, and divides the result by N. This opera- 
tion is represented schematically below. 


0(0 



The overall variances at the output of the FFT will be simply 
the sum of the individual variances due to spacecraft spin and 
receiver thermal noise, namely 


where 

<Kt) T = [0,(0 0 e (Ol (25) 


o 2 = ol + o 2 
R Rs Rt 


°) = a is + °n 


( 22 ) 


where o^ s and o 2 Is are given by Eq. (16) and o\ t and a), by 
Eq. (21). 


E. Conscan Closed-Loop Model 

So far we have discussed the open-loop estimation of the 
pointing errors. In order to proceed with this analysis, we 
defme the closed-loop pointing errors in cross-elevation and 
elevation as follows 


6 , x = d - T 

Y x(n + 1 ) xn xn 

(h ± Q 

Y e(n + 1 ) en en 


(23) 


where d x and 6 e are the Conscan estimates of the pointing 
offsets d x and 0 e , respectively. We will treat all of the above 
angles as continuous variables of time and model the DFT as 
an analog multiplication and integration. This approach is 
allowable because the AGC sampling time / is much smaller 
than the Conscan update time, T = NI. Being in closed-loop, 
we substitute Q x for 6 X and <p e for 6 e in the third term of 
Eq. (6) and rewrite that term in vector notation as follows: 


2 K R 


cos 


2l Tt 

A T 


sin 


2 nt 
~N~ 


- 

0,(0 

- 

0.(0 


(24) 


The factor of 1/2 results from the multiplication of a cosine 
by a cosine and a sine by a sine. Double frequency terms are 
integrated to zero. Thus, the DFT operation for a large Af can 
be modeled as an integrate-and-dump device with the follow- 
ing transfer function in the hybrid sjz- domain. 



In the feedback path of the Conscan loop, the estimated 
errors in antenna pointing are scaled down by a factor G, and 
the antenna pointing corrections, 8 X and 6 C , are obtained. This 
corresponds to the following set of difference equations 


6 = 0 , „ + G0 

xn x(n-l ) 


«,» ■ tc €„ 


(26) 


where and ^ are the estimates of <j> x and 0 e , respectively. 
The subscript n indicates the nth scan period. In Fig. 3 the 
above difference equation has the following z-domain transfer 
function (summer) 


S(z) 


goo - z 

H z ) ( 2 - 1 ) 


(27) 


Because 6 X and 6 e are modeled as continuous variables of 
time, we need to convert the discrete variables 0^.(z) and 0^(z) 
to their continuous counterparts, namely, we need a Digital-to- 
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Analog converter (D/A) in the feedback path of our loop 
model. The transfer function for the D/A is 


(1 -e- sN ) 
s 


(28) 


where e~ sN =z~ 1 . 

By combining the above elements into a block diagram. 
Fig. 3 is obtained. Now the above hybrid s/zAoop model may 
be converted to a z -domain model. Neglecting for the moment 
the noise term N(z), we see by inspection of Fig. 3 that 


X'(s) 


2 KR6(s) 

— : — -*'(*) 



(29) 


Taking the z-transform of Eq. (29) and simplifying, we obtain 


X\z) = 2KR 



X\z) 


G 

(z - 1) 


(30) 


Assuming then the noise sample A^ +1 (l) is independent of 
A^(l) for all n , it can be shown that the steady-state closed- 
loop variances of the pointing error are 


1,0% 

a 2 = (in units of R 2 ) 

X (KR) 2 

(35) 

I. o 2 

a 2 = (in units of R 2 ) 

(KR) 2 


where o| and of are the open-loop variances at the output of 
the FFT and are given by Eq. (22). 

Before moving to the next topic it should be noted that the 
transfer function H(z) of the Conscan loop, Eq. (32), has a 
single pole at z = 1 - G. The stability criterion requires that 
I 1 - G | < 1 in order for the pole to remain inside the unit 
circle. This puts an upper bound on the loop gain which has 
to be less than 2. Control theory predicts that the Conscan 
loop will have a bounded steady-state error to a ramp or 
velocity input. In practice, the velocity components of the 
pointing axis are compensated by predicts, while Conscan 
compensates for step pointing errors. 


where the asterisk denotes the z-transform operation. 

With this transformation, Fig. 3 has the equivalent block 
diagram of Fig. 4, which is entirely in the z-plane. From Fig. 4 
we see that the overall Conscan open loop transfer function is 

= Wj = (TT) (31) 

and the closed-loop transfer function is 


H(z) = 


Y(z) 


G(z) 


fl(,)> [! + <?(*)] [z + (G-l)l 


M 

_ s 


(32) 


Having obtained H{z), we now need the following integral 


= i27i £ 


H{z)H(z~ 1 )=y 


(33) 


Using Table III of [2] , it is found that 

/ = 5L_ 

1 (2 -G) 


(34) 


F. Pointing Jitter and Pointing Loss 

Equation (4) relates the ground antenna pointing loss to the 
pointing error. In order to estimate the closed-loop pointing 
loss, we substitute <t> x for 6 X and <j> e for 6 e in Eq. (3). Then 
we insert Eq. (3) into Eq. (4) and integrate over one scan 
period to obtain 


l r = K r (R 2 + 4>l + <t> 2 ) (36) 

where <t> x and <j> e are random variables with zero mean (assum- 
ing a step input) and the variance is given by Eq. (35). Invok- 
ing the central limit theorem, we can assume that the closed- 
loop pointing errors 4> x and <t> e have a Gaussian distribution. 
Being derived from quadrature processes, they are mutually 
independent. With the above assumptions, we take the ex- 
pected value of Eq. (36) and obtain the closed-loop pointing 
loss, L, namely 

* * E M 


= K 


f? 2 + E 


(0 2 )+E(0 2 ) 


= K(R 2 + o 2 + o 2 ) (dB) 


(37) 

(38) 
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MAG = 


(43) 


G. Transient Response 

Let 8 xn = 6 xl and 8 en = 8 eI for all n. Inserting Eq. (26) in 
Eq. (23) and taking the expected value, it can be shown that 



(KR) 


0 -dr 

Y xn x\ 


.(«-!) 


0 =8 
en el 


or, in vector notation 


(39) 


ANG = tan 1 


Jt/1) 

V I) 


+ 7 


(44) 


where 7 represents the relative angle offset between the AGC 
table of the antenna scan cycle. In Eq. (44), a four-quadrant 
arc tangent is computed. Finally, the estimates of the pointing 
error are computed 


t = 9/ n ~ l) 


(40) 


where 


8 x = MAG • cos (ANG) 
8 g = MAG • sin (ANG) 


(45) 


r - 1 - C (0 <r< 1) 

We now define the time constant, r, as the time in seconds 
that it takes for pointing error <f> n to decay to the value of d l /e , 
where e is the base of the natural logarithm. Solving for the 
number of Conscan periods, n- 1, necessary for Q x to decrease 
by 1 je and for the corresponding r, it can be shown that 


Hr) 


(41) 


This revised estimation algorithm for the pointing error shows 
a slight crosscoupling between 6 X and 0 e . In this simulation, in 
order to mimic more closely an actual antenna, the antenna 
pointing corrections were done gradually in an interval of eight 
to ten samples with a slight overshoot before the final point. 

Figures 6 through 9 display the results of the simulation. 
The variances of the pointing error were averaged over 550 
Conscan periods. The computer-simulated results agree very 
closely with the equations derived in this analysis. 


and 


r 


T 

Hr) 


The following values were used in the simulation: K r R 2 = 
0.1 dB for both the 64- and 70-m antennas; scan period T = 32, 
64, and 128 sec; and sampling time /= 1 sec. Conscan loop 
gain G = 0.05, 0.3, and 0.6. P c = -145 dBm, and the AGC loop 
* ^ SNR was between 20 and 50 dB. 


Figure 5 depicts typical r values for DSN antennas. 


For the Galileo spacecraft, the effect of the spin on the 
transmitted signal was modeled as follows (see Eq. 2) 


III. Computer Simulation 

A. Simulation Model 

A computer program named CONSCAN. FOR using an FFT 
subroutine was written in order to check the above analysis. In 
our simulation, values typical for the Voyager and the Galileo 
missions were used. 

In the actual implementation of the Conscan algorithm at 
the DSN stations the FFT computation can start at any time 
inside a Conscan period. In order to account in our simulation 
for this time shift between the FFT reference point and AGC 
samples, the estimates of the pointing error were computed 
with the following modification to Eq. (12) 


Z/Gcos(gj. f + 0.) = K ( a 2 (dB) (46) 

where a, the effective offset from the correct pointing of the 
spacecraft antenna, is defined as 

a(r) = £ + a 2 2 + + 20^0^ cos (co^ + 0 t ) 

+ 2a 1 a 3 cos(co 2 r + 0 2 ) 

1 1/2 

+ 2a 2 a 3 cos(u) 3 t + (p 3 ) (47) 
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Here 


D. Predicted Performance for Galileo 


= combined pointing error due to Earth-fitting error, 
spacecraft/Earth drift, and attitude determination 
error 

ol 2 = pointing error due to nutation 

a 3 = pointing error due to wobble, mechanical and electri- 
cal misalignments 

In our simulation we made K Y ~ 2a 1 a 2 = 1 dB, K 2 = 
2a^a 3 = 0.2 dB, and K 3 = 2a 2 a 3 - 0.2 dB. Other variables are 
defined as follows: c = nutation spin rate, co 2 = wobble spin 
rate, co 3 = oj 1 - c o 2 . The following values were assumed: 
wobble period T x - 19.048 sec (nominal or low spin rate); 
nutation period T 2 - 14.583 sec (T 2 1 3)\K t = -77.5 dB/ 

deg 2 (curve fit to Galileo X-band antenna gain pattern). 

B. Simulation Results — Choice of Loop Gain and 
Scan Period 

From Figs. 6 through 9 we can make the following 
observations: 

Increasing the gain G decreases the time constant and, 
hence, the lock-up time, although it increases the pointing 
fluctuations and pointing loss (Figs. 6 and 7). This fact sug- 
gests the following operational strategy: choose a large gain 
value during lock-up, and a small gain value during tracking. 
This strategy has been successfully used with the Real-Time 
Combiner of the DSN Baseband Assembly. 

Increasing the scan period decreases the pointing jitter 
(Figs. 8 and 9). This effect is especially pronounced for 
Galileo which has larger pointing jitter due to spacecraft 
spin. Increasing the scan period, however, also slows down 
the pointing correction process. 

C. Simulation Versus Actual Performance for 
Voyager 

The simulation result is comparable with the observed 
antenna Conscan performance in supporting Voyager. Figure 8 
predicts a 1-sigma pointing error of 0.5 (128-sec scan) to 1.5 
mdeg (32-sec scan) for Voyager, assuming thermal noise 
35 dB below the carrier level in a 1-Hz AGC loop. In [3] the 
Conscan performance of the DSS 14 64-m antenna in pointing 
Voyager over 5 days in 1987 is reported (see Fig. A5 in [3]), 
with statistics given in Fig. A6 for individual scans with a 
32-sec period. The standard deviation reported is 1.7 mdeg, in 
good agreement with our simulation result. 


In the case of Galileo, the Conscan performance is affected 
by both thermal noise in the receiver and the wobble and nuta- 
tion reflected in the spacecraft signal. Figure 9 indicates that 
the thermal noise is the dominant effect when the AGC loop 
SNR is lower than about 30 dB. But the wobble and nutation 
effect dominates when the AGC is higher than 30 dB. As a 
result, the pointing error can be reduced by increasing the 
AGC SNR below 30 dB, but not beyond 30 dB. 

It should be noted that the Conscan period must be chosen 
carefully in order to avoid harmonic relationship with the 
wobble and nutation processes. In case both wobble and nuta- 
tion are present, the scan period of 64 sec should be avoided 
because its frequency, 0.0156 Hz, is very close to the differ- 
ence between the wobble and nutation frequencies, 0.0162 Hz. 
Abnormal pointing errors were indeed observed in simulation. 

E. Simulation Results Versus Predicted 
Performance 

Simulation results give slightly more conservative values 
for the pointing error variances than the ones predicted by 
the closed-form solution, Eq. (35). The possible reasons are 
(a) in our simulation, the lock-up transients have partially 
effected the statistics of the pointing error; and (b) an error in 
the phase offset y in Eq. (44) was deliberately introduced in 
our simulation in order to approximate the actual pointing 
imperfections. 

The discrepancy can be illustrated with two cases. For 
Voyager, with T = 32 sec, SNR = 35 dB and G = 0.6, the 
standard deviation of the pointing error is 0.6 mdeg according 
to the closed-form solution and 1 mdeg according to the sim- 
ulation. For Galileo, with the same conditions, the standard 
deviation of the pointing error is 2 mdeg according to the 
closed-form solution and 3 mdeg according to the simulation. 


IV. Conclusion 

A model has been developed to analyze and simulate the 
performance of DSN antenna Conscan as affected by the fluc- 
tuations in the receiver AGC signal. The effects of the receiver 
thermal noise and of the spinning spacecraft were analyzed 
and simulated. The simulation results agreed well with the 
observed performance of the DSS 14 antenna supporting Voy- 
ager. The simulation results show a standard deviation of 1.5 
mdeg at X-band with 32-sec scan for a 35 -dB AGC SNR. Ob- 
served performance at DSS 14 [3] for Voyager under similar 
AGC SNR conditions was about 1.7 mdeg for individual 
scans. 
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The simulation results suggested that the Conscan loop gain 
and the scan period can be chosen to optimize the pointing 
performance. A higher gain and a shorter scan period can 
reduce pointing acquisition time at the beginning of the track. 
A lower gain and a longer scan period can reduce the standard 
deviation of pointing jitters, and, therefore, the loss, during 
track. 

The simulation result for Galileo indicated higher pointing 
jitter than Voyager because of the additional effect of space- 
craft wobble and nutation on the ground receiver AGC signal. 
The spacecraft effect is dominated by the ground receiver ther- 
mal noise effect for the AGC SNR below 30 dB and becomes 
dominant when the SNR is above 30 dB. Figure 9 predicts a 


relative constant pointing error in the latter region. This region 
is typical for Galileo support when the carrier signal level is at 
least -155 dBm. 

The Conscan pointing error for Galileo under such condi- 
tions was between 1 mdeg (128-sec scan) and 4 mdeg (32-sec 
scan). The corresponding losses versus the standard deviations 
of pointing errors are given in Fig. 10. 

It was noted that the choice of a Conscan period for Galileo 
must avoid harmonic relationships with the Galileo wobble 
and nutation frequencies. The results of the analysis indicated 
that the 64-sec scan should not be used for Galileo when both 
wobble and nutation are present. 
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ig. 7. Transient response of pointing error simulation for Galileo 
spacecraft (a typical case). 
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Fig. 8. Steady-state pointing error versus AGC SNR simulation for 
Voyager spacecraft. 
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Fig. 9. Steady-state pointing error versus AGC SNR simulation for 
Galileo spacecraft. 
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Fig. 10. Pointing loss versus standard deviation of the pointing 
error at X-band for 34-, 64-, and 70-m antennas. Nominal pointing 
loss due to Conscan = 0.1 dB. 
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Appendix A 

Derivation of the Mean Value and Variance at the Output of DFT 


Let D k be the Discrete Fourier Transform operator defined 

( ol for k = 0 


as follows 



II 

c 

(A-10) 


_ 


( 0 otherwise 


- $ E 
v f /= 0 

I 2nkj\ . . ( 2nkj\ 

C0S \ N ) 1 " in \ N j 

(A-l) 

a^(fc) = 0 for all k 

(A-l 1 ) 

= R(k) = R R (k) + iRJk) 

(A-2) 

B. Sj Periodic as Defined by Eq. (A-4) 



where R R (k) and RAk) are the real and imaginary parts of . 

. R . . 1 With K = NI T, = integer, we have 

R(k), respectively. ' ' 


We want to find the mean value and the variance of R(k) 
for the following three cases 


Scos = Scos (A-l 2) 
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(A-3) 

Using Eq. (A-l) we obtain 

S cos | 

1 2irlj \ 

L „ NI. 
so that K = — is an 

integer * 

(A-4) 


S cos 

(? ••■) 

for any nonrandom 
7} and random <p f , 
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(A-5) 



Ms 

2 


for * = a: 


where S is a random variable assumed to be constant during 
one Conscan period. And we define 

E{S} = n s and var{5} = a 2 (A -6) 

E{D*0)1 = /*(*) = H R (k) + itij(k) (A -7 a) 

E {D k (r) 2 } - n(k) 2 = a 2 (k) = o 2 (k) + of(k) (A-7b) 

A. Sj = S 

Using Eq. (A-l) it is easy to see that 


0 otherwise 
lij(k) = 0 for all k 


% for it = K 


ol(k) = 


Oj(k) = 0 


0 otherwise 
for all k 


(A-13) 


(A-l 4) 


(A-l 5) 


(A-l 6) 


(k) 


Ms 


for k = 0 

(A -8) 

otherwise 


for all k 

(A-9) 


C. Sj Periodic With Any Period Length and Phase Shift 

Let 

(A-l 7) 


S t = Scos(M) +0 . 


. (2vlj\ . , . (2irlj\ 

cos 0. cos ^-y-1 - sin <t>. sin l-y- 1 


(A-l 8) 
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D. White Noise 

Assuming that the variance of each voltage sample due to 
thermal noise is , the variance of the sum of TV samples will 
be Nol . The FFT algorithm performs the N complex sums and 
divides the result by N. Hence the total variance of the com- 
plex FFT output will be 


Since in our implementation of the DFT the output is split 
into its real and imaginary parts, the variance of each of them 


will be one-half of o 2 . 

V 
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